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PART ONE 


Experiments in 
Psychodiagnostics 


THOUGHT DISORDER IN 
PSYCHOTIC PATIENTS 


R. W. Payne and J. H. G. Hewlett 


I. INTRODUCTION! 


IT is probable that different types of thought disorder are associated 
with different psychiatric illnesses. This study is concerned mainly with 
schizophrenic thought disorder which has been described in detail by 
a number of psychiatrists on the basis of material collected in inter- 
views. However, relatively few of these writers have elaborated a theory 
about the nature of this type of thought disorder in sufficient detail to 
enable objective tests to be derived from it. 


(a) Kretschmer's Theory 


One of the earliest psychiatrists to develop a detailed theory of schizo- 
phrenic thought disorder was Ernst Kretschmer (1936, 1951). Kretsch- 
mer accepted Bleuler's (1950) theory that a generalized mental dissocia- 
tion or disintegration was the main cause of schizophrenic thought 
disorder, and elaborated the idea considerably. Kretschmer believed 
that schizophrenia was an extreme position on a fundamental consti- 
tutional variable which he called ‘schizothymia-cyclothymia’. This 
variable was regarded as being more or less normally distributed 
throughout the general population, most people having some inter- 
mediate position on it. A number of characteristics, both physical and 
psychological, were said to be associated with this constitutional fac- 
tor. According to the theory, extremely ‘schizothymic’ people are of 

1 The research described here was made possible by a research grant from the 
governors of the Maudsley and Bethlem Royal Hospitals, 
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leptomorphic physique, and also sharea number of psychological charac- 
teristics. Extremely cyclothymic people are of pyknic physique, and share 
other psychological characteristics. Individuals who are abnormally 
'schizothymic' tend to develop schizophrenia, while individuals who are 
extremely ‘cyclothymic’ tend to develop manic-depressive psychosis. 
Presumably an individual’s position on this constitutional variable is 
largely the result of inheritance, so that only minor changes can be 
effected by the environment. 

This theory has a number of implications. One is that a person who 
is later to develop schizophrenia is from birth extremely ‘schizothymic’. 
Even if he recovers from a schizophrenic illness, he is unlikely to have 
moved far along the continuum, and will always remain a ‘schizo- 
thymic’ personality, liable to a further illness. Another implication is 
that, since schizophrenia and manic-depressive psychosis are at opposite 
ends of the continuum, mixed psychotic states, with features of both, 
would be incompatible with the theory. 

One of the main personality characteristics associated with ‘schizo- 
thymia-cyclothymia’ was held to be ‘Spaltungsfahigkeit’, or ‘dissocia- 
tion’. It too was held to be a continuous variable on which schizophrenics 
are extremely dissociated, and manic-depressives extremely non-disso- 
ciated. Kretschmer defined dissociation as the tendency for mental 
activity to occur in isolation, being uninfluenced by and isolated from 
other mental activity. Its opposite is a high degree of integration. An 
abnormal amount of dissociation could produce schizophrenic thought 
disorder, characterized by a ‘fragmentation’ of mental activity, and a 
lack of logical relationships between systems of ideas. Dissociation 
could also explain inappropriate affect, and the maintenance of delu- 
sional ideas at variance with everyday experience. 

Following an early empirical study by Van der Horst (1924), Kret- 
schmer (1928) listed a number of psychological tests which should be 
measures of the trait of ‘dissociation’. All the measures were estimates 
of an individual’s capacity for dissociating his attention, or performing 
more than one task at the same time. Some were motor, some percep- 
tual, and some were mental tasks. If dissociation were the common 
factor underlying performance on these tests, they should intercorrelate 
significantly and yield a general factor. Payne (1955) assembled a group 
of typical tests of 'Spaltungsfáhigkeit', including all the major tests that 
Kretschmer had reported on. The tests were given to 100 normal people. 
Seventeen ‘dissociation’ scores were obtained from each subject. It was 
found that none of these scores correlated significantly. That is to say, 
the matrix of correlations obtained showed only chance fluctuations 
from a mean value of zero. The implication is that the ability to disso- 
ciate one’s attention is very specific and is entirely a function of the 
tasks involved. This result appears to be fatal to Kretschmer’s theory. 
For one thing, if the ‘Spaltungsfahigkeit’ tests are uncorrelated, it is 
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unlikely that they will differentiate abnormal groups in any consistent 
fashion. Indeed, they appear not to. Brengelmann (1954) in an extensive 
review of the German literature on these tests, reports that there is no 
consistent evidence that schizophrenics, normal ‘leptosomes’, normal 
*pyknics' and manic-depressive psychotics obtain the mean scores on 
the tests demanded by the theory. In any event, Kretschmer laid great 
emphasis on the notion that this hypothetical factor accounts for large 
individual differences within the normal population. However, no such 
factor appears to exist. 

Perhaps this result is not altogether surprising, since ‘Spaltungsfahig- 
keit’, as operationally defined by Kretschmer, is a positive ability. 
People who are good at dividing their attention will have an advantage 
in a number of cognitive situations. Even if there were evidence for a 
general ability of this sort, one is in the curious position of explaining 
a thinking deficit, schizophrenic thought disorder, as the result of an 
excess of some useful intellectual capacity. 


(b) Babcock’s Theory 

Harriet Babcock (1930, 1933, 1941) advanced a different, and much 
simpler explanation of schizophrenic thought disorder. She suggested 
that in essence it is merely an extreme intellectual slowness. It is well 
known that people differ with respect to their speed of problem solving. 
Indeed Furneaux (1956) has recently demonstrated that individual dif- 
ferences in the speed of problem solving are an important determinant 
of the score obtained in a typical untimed test of general intelligence. 
Furthermore, intellectual speed is more or less uncorrelated with other 
intellectual functions. For example, fast thinkers do not necessarily 
make a larger number of errors. There is a very low correlation between 
speed and accuracy. 

Babcock argued that schizophrenic thought disorder results mainly 
from a gross retardation of intellectual speed. Schizophrenic patients 
think so slowly that in a normal interview situation, they have not had 
time to think through the answer to one question before they have been 
given the next question. This makes them appear thought disordered, 
since one method of dealing with such a handicap is merely to answer 
at random. This could even explain their relative social isolation. They 
think too slowly to engage in normal conversation, so they tend to 
become socially withdrawn. 

No attempt will be made to review the literature concerned with the 
experiments relevant to Babcock’s theory, since this has been done at 
length elsewhere (Payne, 1960). By and large, previous investigations 
have tended to support Babcock’s theory, in that schizophrenics have 
been found to be slow at problem solving. On the whole, acute or early 
schizophrenics are only moderately slow, while chronic or deteriorated 
schizophrenics are extremely slow. However, depressed patients have 
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usually been found to be just as slow as chronic schizophrenics. These 
and other findings suggest the possibility that abnormal slowness of 
thinking is not a unitary phenomenon. It may have several causes, some 
people being slow because their minds work slowly, some because they 
are unable to concentrate, being constantly distracted, and others be- 
cause they are thinking about a different problem which they believe 
to be relevant. 


(c) The Theory of ‘Concreteness’ 


Two entirely different theories of schizophrenic thought disorder have 
one thing in common. Both suggest that it is essentially due to an ab- 
normality of concept formation of some sort. However, they disagree 
about the nature of this abnormality. 

Kurt Goldstein and several other writers (Bolles and Goldstein, 1938; 
Goldstein, 1939, 1946; Hanfmann and Kasanin, 1937, 1942; Kasanin, 
1946; Kasanin and Hanfmann, 1938a, 1938b) have argued that schizo- 
phrenics are abnormally ‘concrete’. That is to say, they are unable to 
perform inductive reasoning since they are unable to make an abstract 
generalization. A number of experiments have been carried out to inves- 
tigate this hypothesis. These will not be reviewed in detail, as this has 
been done elsewhere (Payne, 1960; Payne, Matussek and George, 1959). 
Many of these experiments make use of either the Goldstein-Scheerer 
(1941) sorting tests, or the Vigotsky (1934) test to assess ‘concreteness’. 
The main fault of the studies which report schizophrenics to be abnor- 
mally ‘concrete’, lies in the criteria of ‘concreteness’ used. For example, 
Goldstein and Scheerer (1941) label a large number of responses as 
‘concrete’, including a failure to sort at all, a failure to ‘shift’ from one 
method of sorting to another, a failure to give an adequate verbal 
account of the sorting, and several unusual ways of sorting the material. 
Furthermore, if the test instructions are followed exactly, individuals 
who are initially ‘concrete’ are often not allowed to continue to sort 
the material spontaneously, in order to see whether later they will 
produce a more adequate method of sorting. Instead they are given a 
series of ‘control experiments’ aimed at assisting them with the task. 
Similarly an unusually slow performance on the Vigotsky tests would 
be labelled ‘concrete’ if the Hanfmann-Kasanin (1937) scoring method 
is used, as would an unusual generalization. 

More recent work (Fisher, 1950; Rashkis, Cushman and Landis, 
1946; Rashkis, 1947; Rapaport et al., 1945: McGaughran, 1954, 1957; 
McGaughran and Moran, 1956; Fey, 1951) has consistently suggested 
that, on sorting tests, schizophrenics tend to produce unusual generali- 
zations when compared with normals. However, they can produce just 
as many generalizations as normal people, and it seems unreasonable 
to label this behaviour ‘concrete’, thus identifying it with the inability 
to generalize at all. 
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The other main method of investigating ‘concreteness’ in schizo- 
phrenics, has been by the use of the ‘proverbs’ test. Studies by Wegrocki 
(1940), Benjamin (1946), Becker (1956) and Gorham (1956) have pro- 
duced consistent results. Schizophrenics give more ‘concrete’ interpre- 
tations to proverbs than do normals, whether the usual method of 
administration is used, or whether a multiple choice form of the test is 
used. In view of the results obtained with sorting tests, these results 
cannot be regarded as unambiguous. There is no doubt that schizo- 
phrenics tend to define words and interpret proverbs peculiarly. How- 
ever, this is not necessarily due to an inability to generalize. It is possible 
that unusual interpretations are often arbitrarily rated as ‘concrete’. 
It is also possible that, just as they tend to use words peculiarly, 
schizophrenics tend to interpret test instructions peculiarly. They may 
interpret the instructions as allowing the use of apt ‘concrete’ illustra- 
tions of the meaning the proverb has for them, as well as allowing a 
general explanation of the principle illustrated. It is not certain that in 
these studies the schizophrenics were encouraged to go on talking. 
Some abstract generalization might ultimately have been given, even if 
this were preceded by apparently ‘concrete’ statements, or apparently 
irrelevant remarks. 

Previous studies then, are at best ambiguous. There is no conclusive 
evidence that schizophrenics are more ‘concrete’ than normal, although 
there is considerable evidence that the generalizations they make tend 
to be unusual. 


(d) Cameron's Theory of ‘Overinclusion’ 

Norman Cameron (1938a, 1938b, 1939a, 1939b, 1944, 1947; Cameron 
and Magaret, 1949, 1950, 1951) hassuggested thatschizophrenic thought 
disorder is due to a quite different abnormality of concept formation. 
He believes that schizophrenics' concepts are over-generalized. Schizo- 
phrenics are unable to maintain the normal conceptual boundaries, and 
incorporate into their concepts elements, some of them personal, which 
are merely associated with the concept, but are not an essential part of 
it. Cameron used the term ‘overinclusion’ to describe this abnormality. 
Cameron reported that in working on the Vigotsky test, and a sentence 
completion test, schizophrenics were unable to preserve the ‘conceptual 
boundaries’ of the task. In solving a problem, the schizophrenics *. . . 
included such a variety of categories at one time, that the specific 
problems became too extensive and too complex for a solution to be 
reached’ (Cameron, 19392). 

A surprising number of experiments have been carried out to investi- 
gate this theory. These have been reviewed elsewhere (Payne, 1960; 
Payne, Matussek and George, 1959). The results obtained have con- 
Sistently supported the theory. For example, Moran (1953) and Epstein 
(1953) developed very similar tests, consisting of a list of words printed 
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on a sheet of paper. Following each stimulus word was a number of 
response words, including neologisms, and the word ‘none’. The subjects 
were asked to underline each response word which they regarded as an 
essential part of the concept denoted by the stimulus word. Individuals 
whose concepts were abnormally overinclusive were expected to under- 
line more words. Both investigators found that schizophrenics under- 
lined significantly more words than normals. 

White (1949), testing a matched schizoph.enic and normal group, 
asked his subjects to group fifteen cards with a word printed on each 
in any way they liked. The schizophrenics tended to form very large, 
vague categories, forming concepts such as ‘suspicion’ or ‘having to do 
with God’. 

Chapman (1956) and Chapman and Taylor (1957) report a series of 
interesting experiments which have also confirmed the theory of over- 
inclusion. They made use of card sorting tests of different types, and 
found that, when asked to sort according to a specific concept (e.g. 
‘fruit’), schizophrenics tended to include in this category similar cards 
(e.g. cards depicting vegetables) but not completely dissimilar cards. 
Normals did not do this. In another experiment, the subjects were pre- 
sented with cards containing four pictures, some of which illustrated a 
concept. The subjects were asked to sort the cards according to a con- 
cept, but to disregard all the pictures on the cards except the picture in 
the lower right-hand corner. As predicted, the schizophrenics were in- 
fluenced by these irrelevant ‘distractor’ items, whereas the normals were 
not. There was no evidence of ‘concreteness’ or inability to sort at all, 
since the number of errors made varied directly with the number of 
distractor items relevant to the sorting categories, there being no sig- 
nificant difference between normals and schizophrenics when distractor 
items of this sort were not present. 

It must be concluded that Cameron's theory of overinclusion has 
received strong support from the experiments so far carried out. 


(e) A Reformulation of the Theory of Overinclusion 


Payne, Matussek and George (1959) have suggested that it is possible 
to reformulate Cameron's theory of overinclusion in a slightly more 
general way so that a number of predictions follow from it. Concept 
formation can be regarded as largely the result of discrimination learn- 
ing. When a child first hears a word in a certain context, the word is 
associated with the entire situation (stimulus compound). As the word 
is heard again and again, only certain aspects of the stimulus compound 
are reinforced. Gradually the extraneous elements cease to evoke the 
response (the word) having become ‘inhibited’ through lack of ‘rein- 
forcement’. This ‘inhibition’ is in some sense an active process, as it 
suppresses a response which was formerly evoked by the stimulus. 
‘Overinclusive thinking’ may be the result of a disorder of the process 
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whereby ‘inhibition’ is built up to circumscribe and define the learned 
response (the word or ‘concept’). In short, it could be an extreme 
degree of ‘stimulus generalization’. 

The same theory can be expressed in different terms. All purposeful 
behaviour depends for its success on the fact that some stimuli are 
‘attended to’ and some other stimuli are ignored. It is a well-known 
fact that when concentrating on one task, normal people are quite 
unaware of most stimuli irrelevant to the task. It is as if some ‘filter 
mechanism’ cuts out or inhibits the stimuli, both internal and external, 
which are irrelevant to the task in hand, to allow the most efficient 
‘processing’ of incoming information. Overinclusive thinking might be 
only one aspect of a general breakdown of this ‘filter’ mechanism. 


II. THE PRELIMINARY INVESTIGATION OF THOUGHT DISORDER 
IN SCHIZOPHRENICS 


While many studies had been carried out to assess separately the various 
abnormalities described, no large-scale study had been conducted to 
compare measures of slowness, concreteness and overinclusion on the 
same schizophrenic and normal populations in order to determine which 
type of dysfunction is most characteristic of schizophrenia. 

Payne, Matussek and George (1959) administered a large battery of 
tests, including measures of slowness, concreteness and overinclusion to 
eighteen schizophrenic and sixteen neurotic in-patients. The groups were 
carefully matched for age, Mill Hill Vocabulary score (Raven, 1948), 
Progressive Matrices score (Raven, 1948) and Nufferno ‘Level’ score 
(Furneaux, 1956), both groups being almost exactly of average mean 
intelligence on all three tests. 

Most of the tests given have been used in the present study and will 
be described in detail later. In order to test Babcock's theory that 
abnormal intellectual slowness is the basis of schizophrenic thought dis- 
order, three tests of intellectual speed were given, Nufferno speed tests 
Al (individual, unstressed), A2 (individual stressed) and B1 (individual, 
stressed) (Furneaux, 1956). 

The schizophrenics were slower than the neurotics, but on only one 
test did the mean difference achieve the 5% level of statistical signifi- 
cance (on a ‘one tail’ test). A Nufferno 'speed-level discrepancy’ score 
also failed to differentiate the groups significantly. It was predicted that 
if schizophrenics were abnormally slow, they would tend to compensate 
for this by working at their maximum speed all the time. Thus, when 
their normal working rate (as measured by the ‘unstressed’ speed test— 
Al) was compared with their maximum rate (as measured by ‘stressed’ 
speed test A2) they should show little improvement in speed as compared 
with the neurotics. While this measure of ‘stress gain’ differentiated the 
groups as expected (i.e. schizophrenics improved less when asked to 
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work as fast as possible), the difference was only statistically significant 
at the 5% level by a ‘one tail’ test. It was also predicted that, if schizo- 
phrenics were abnormally slow thinkers, they would tend to compensate 
for this by being more persistent. However, a measure of persistence 
consisting of the two longest times spent during the Nufferno Level test 
on items which were later abandoned or solved incorrectly, failed to 
differentiate the groups significantly. It was concluded that intellectual 
slowness is not an important abnormality, fundamental to thought dis- 
order in schizophrenia, as Babcock’s theory suggests. It was suggested 
that intellectual slowness in schizophrenia might be a secondary by- 
product of ‘overinclusion’. Because they perceive ‘irrelevant’ aspects of 
the test as being necessary for a solution (e.g. their personal associations 
to the letters in the Nufferno letter-series items), schizophrenics have 
more data to think about. They must consider a larger number of pos- 
sible solutions, and this would result in a longer average solution time 
per item, with a greater variability of solution times. This would not 
necessarily lead to an increase in errors, as solutions based on irrelevant 
aspects of the material might still be rejected if they did not ‘fit’. 

Babcock believed that abnormal slowness in schizophrenia is funda- 
mental, and affects simple motor speed tests as well as more complex 
intellectual tests. To test this theory, four of Babcock and Levy’s (1940) 
simple motor speed tests were administered. These were the speed of 
writing ‘the United States of America’, the speed of writing one’s own 
name, the speed of writing a sentence from dictation, and the speed of 
performing the Babcock-Levy (1940) ‘substitution’ test. While the schizo- 
phrenics were slightly slower on all four measures, none of the differ- 
ences reached statistical significance. Again, Babcock’s theory that 
slowness is of fundamental importance, was not supported. 

Five measures of ‘concreteness’ were obtained from the two groups. 
These were ratings obtained in the traditional way from the Goldstein- 
Scheerer (1941) Colour-Form and Object-Sorting tests, and Benjamin’s 
(1946) Proverbs test, and two scores derived from a modified form of 
Feldman and Drasgow’s (1951) test for concept formation. Only one 
of these scores differentiated the groups significantly. This was the rat- 
ing of ‘concreteness’ obtained from the Goldstein-Scheerer Object- 
Sorting test. The schizophrenics were significantly more ‘concrete’. It 
was concluded that the heterogeneous nature of the test material of the 
object-sorting test favoured the production of unusual responses in the 
schizophrenic group, and that for this reason they were labelled more 
‘concrete’ when the Goldstein-Scheerer test administration and scoring 
criteria were followed. Goldstein and Scheerer (1941) have identified 
‘concreteness’ with ‘rigidity’. Two measures of ‘rigidity’ were obtained 
from the modified Feldman-Drasgow test. These were based on the 
number of times the two alternative methods of classifying the objects 
on each card were perceived. Only when a one-minute time limit per 
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card was enforced did the rigidity score differentiate the groups signi- 
ficantly, and it was pointed out that this result was almost certainly due 
to the tendency among the schizophrenics to produce unusual group- 
ings, which were not scored as correct. It was concluded that there was 
no evidence of abnormal concreteness in the schizophrenic group. 

Thirteen different measures of ‘overinclusion’ were obtained from 
the Benjamin Proverbs test, the Goldstein-Scheerer Object-Sorting 
test, an ‘Object Classification Test’, the Epstein (1953) test for ‘over- 
inclusion’, and the Leiter Partington (1950) ‘Pathways’ test. All these 
scores will be described in detail later. Seven of these scores differen- 
tiated the groups as predicted, at well beyond the 1% level of signifi- 
cance. Only four scores failed to produce significant differences, al- 
though the mean differences were in the expected direction. 

The Klein and Krech (1952) tactile ‘figural after effect’ test was given 
to the two groups. There was evidence that the schizophrenics developed 
weaker figural after effects, and might be regarded as developing less 
‘reactive inhibition’. While this result was not regarded as unambiguous, 
it was consistent with the hypothesis that ‘overinclusion’, which was 
thought to be due to a defect of some central ‘filtering’ mechanism which 
inhibits irrelevant stimuli, is associated with a general inability to 
develop cortical ‘inhibition’. 

This study suggested that ‘overinclusion’ may be a fundamental cause 
of schizophrenic thought disorder. The present study was designed to 
follow up these results. 


III. THE PURPOSE OF THE PRESENT STUDY 


The studies previously mentioned, and the initial study (Payne, Matussek 
and George, 1959) are all consistent with the hypothesis that ‘over- 
inclusion’ is an abnormality fundamental to schizophrenic thought 
disorder. However, all these studies have one weakness. It has not been 
demonstrated that abnormal ‘overinclusion’ is specific to schizophrenia. 
It might be equally frequent in depressive patients, and thus might 
better be regarded as a general characteristic of psychosis. 

A study by Eysenck (1952a, 1952b) suggests that this is a strong 
possibility. Eysenck (1952a), in attempting to find evidence for Kretsch- 
mer's typology ‘schizothymia-cyclothymia’, tested a group of a hundred 
normal soldiers, fifty schizophrenic patients, and fifty manic-depressive 
patients. They were given a large battery of tests, most of which were 
selected because it was thought that they would differentiate between 
the two psychotic groups. Twenty of the tests which differentiated best 
were factor analysed separately for the normal and the psychotic groups. 
It was demonstrated that only one factor was necessary to account for 
the differences observed between the three groups. This factor was 
labelled ‘psychoticism’. On the tests of ‘psychoticism’, normals had the 
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lowest scores, schizophrenics tended to have intermediate scores, while 
manic-depressives tended to have the most extreme scores. It had been 
expected that a second factor, independent of ‘psychoticism’, but differ- 
entiating the two psychotic groups would be obtained. However, no 
such factor could be demonstrated, and the results were consistent with 
the theory that in the population investigated, the depressives differed 
from the schizophrenics only in that they were more ‘psychotic’. 

Two later studies (Eysenck, Granger and Brengzlmann, 1957; Eysenck, 
S.B.G., 1955) investigated further tests of “psychoticism’. However, in 
these studies, the two psychotic groups were not analysed separately, 
schizophrenics and manic-depressives being combined. In both re- 
searches, a number of tests successfully differentiated between a com- 
bined psychotic group, and a normal and a neurotic group. 

The aims of the present investigation can be summarized as follows: 

(1) To determine whether ‘overinclusion’ is an abnormality specific 
to schizophrenia, or whether it is a general characteristic of psychotic 
patients. 

(2) To determine whether or not neurotic patients suffer from any 
degree of thought disorder. It is generally believed that neurosis does 
not produce thought disorder of the kinds described, hence the use of 
neurotic patients as a control group in the preliminary investigation. 

(3) To determine whether or not tests of ‘overinclusion’ intercorre- 
late significantly, and thus measure some factor in common. While the 
preliminary study suggested that they differentiate schizophrenics from 
neurotics as predicted, it is necessary to demonstrate that they have 
something in common before we can conclude that they all measure the 
same basic abnormality. It is further necessary to demonstrate that the 
common factor shared by these tests (as measured by a factor score) 
also differentiates schizophrenics from the other groups. 

(4) To determine whether general psycho-motor slowness is more 
characteristic of depressed patients than of schizophrenics. It has been 
mentioned that although Babcock thought psycho-motor slowness to 
be fundamental to schizophrenia, several studies suggest that depressed 
patients as a group are considerably slower than acute schizophrenic 
patients. The evidence so far available Suggests that there might be two 
unrelated causes of slowness in psychotic patients. It is possible that 
depressives are extremely slow at all types of task, motor, perceptual 
and mental, possibly because they are so distracted by their depressive 
ideas that they are generally retarded. Slowness in acute schizophrenics 
on the other hand, might be confined to intellectual tasks. It is possible 
that they are slow at problem solving, merely because their overinclusive 
thinking forces them to consider an unduly large number of possibilities. 
If this general formulation were correct, we would expect to find in a 


ion of normals, neurotics and psychotics, that tests of 
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‘overinclusion’ would be unrelated to tests of simple motor and per- 
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ceptual speed. Tests of reasoning speed on the other hand, would be 
related to both tests of overinclusion, and tests of motor and perceptual 
slowness, since slowness of reasoning can be caused either by general 
retardation or by ‘overinclusion’. 

(5) To investigate the relationship between the tests of ‘psychoticism’ 
used by Eysenck and his co-workers, and measures of ‘overinclusive’ 
thinking and slowness. 

(6) To replicate the^previous finding that schizophrenics are not 
abnormally ‘concrete’. 


IV. THE GROUPS TESTED 


The subjects tested consisted of twenty normal people, twenty neurotic 
patients, twenty depressive patients and twenty schizophrenics. These 
groups were matched for age, pre-illness intellectual level (as assessed by 
a vocabulary test), socio-economic status, and educational attainment. 

The twenty normal subjects were selected carefully to match the 
abnormal groups as closely as possible. They were approached indi- 
vidually, and were drawn from a very wide range of sources. Their 
backgrounds were very varied, and their occupations ranged from ‘home 
help’ to physician. Some of the occupations sampled were counter hand, 
carpenter, plumber, garage hand, engineering student, typist, secretary, 
motor mechanic, house-painter, shop assistant supervisor and general 
clerk. This wide range of occupation was necessary in order to match 
them with the patients, who were equally heterogeneous. It had initially 
been intended to match the groups for sex as well as the other variables. 
However, it proved so difficult to match the groups on five independent 
variables, that sex was allowed to go uncontrolled, as it was thought to 
be the least important factor. In the normal group, there were nine 
males and eleven females. 

The twenty schizophrenic patients consisted of ten males and ten 
females. The group was composed mainly of ‘acute’ or ‘early’ schizo- 
phrenics. Only one was regarded as chronic, and the remaining patients 
were not judged to have deteriorated markedly by the psychiatrists in 
charge. On the other hand, only typical, clear cut cases were selected. 
A number of these patients had been admitted previously with a similar 
illness. All were inpatients of the Maudsley Hospital, except for two 
cases who were inpatients of Cane Hill Hospital. Most were tested 
within two weeks to one month after admission and all were judged 
Still to be psychotic at the time of testing. These subjects were, on 
average, considerably more co-operative than a random sample of acute 
Schizophrenic patients. This was because the lengthy testing programme 
(two days) required a considerable degree of co-operation. A number 
of subjects were excluded because they were unable to co-operate suf- 
ficiently. However, several quite disturbed patients were persuaded to 
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take the entire battery. On average, the testing required four two-hour 
sessions, although some slower patients required longer than this. So 
far as possible, all the subjects were tested before they had begun treat- 
ment, and before they had been given any drugs (apart from mild 
night sedation). This proved a difficult condition to fulfil, as a large 
proportion of the psychotic inpatient population in the hospitals in- 
volved, were receiving drugs soon after admission. Only two of the 
schizophrenics tested were having drugs at the time of testing (apart 
from mild night sedation). Both were having ‘Largactil’ (50 mg. three 
times a day). No attempt was made to select any particular sub-type of 
schizophrenia. A number of the patients selected were clinically judged 
to be suffering from ‘schizophrenic thought disorder’ at the time of 
testing. However, this was not the case for a little over half the sub- 
jects. The psychiatric diagnoses which had been given at the time of 
testing were as follows: 

‘Schizophrenia’, 2; ‘Paranoid Schizophrenia’, 3; ‘Paranoid Schizo- 
phrenia with affective features’, 1; ‘Paranoid Schizophrenia’ (later re- 
diagnosed merely as ‘Paranoid State’), 1; ‘Simple Schizophrenia’, 4; 
‘Catatonic Schizophrenia’, 3; ‘Hebephrenic Schizophrenia’, 2; ‘Acute 
Relapse in chronic, hebephrenic schizophrenia’, 1; ‘Acute Schizophrenia 
following operation’, 1; ‘Schizophrenia with depressive features’, 2 (one 
of these was later re-diagnosed as ‘depressive disorder in a person with 
a marked schizoid personality’). 

The twenty patients in the depressive group consisted of seven males 
and thirteen females. It was originally hoped to make this a group of 
‘manic-depressive’ psychotics. This aim had to be abandoned because 
of the shortage of such patients in the two hospitals. Very few manic 
patients were available, and those that were, were usually too disturbed 
to co-operate in such a long testing programme. Manic-depressive 
patients were even scarcer. Patients who had been hospitalized with 
both a manic and a depressive episode were too infrequently admitted 
to allow a group to be collected within a reasonable length of time. 
(The present groups took over a year to collect.) This group in- 
cludes only two ‘manic-depressive’ cases. One patient was tested during 
a depressive episode, although she had had previous admissions 
with a manic illness. The other patient was actually tested during a 
hypomanic phase, although she had had episodes of depression. The 
remainder of the cases could be regarded as psychotic. All were selected 
as being reasonably typical of ‘endogenous depression’. However, many 
of the doctors who assisted in this selection did not themselves believe 
that ‘endogenous’ and ‘exogenous’ depressions can be regarded as 
separate categories. Rather, they believed that there is a continuum 
from the purely endogenous to the purely exogenous, most cases fall- 
ing in between. The cases selected were therefore regarded as falling 
distinctly towards the ‘endogenous’ end of the continuum, As a group 
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they were characterized by ideas of guilt and unworthiness, retardation, 
depressive delusions, a history of previous depressive attacks, and the 
absence of any clear precipitating causes. Fhe sample selected was not 
a random sample of the inpatient ‘endogenous depressive’ population. 
On average, the depressive inpatient population was considerably older 
than the other groups. Thus, in order to match the groups for age, the 
depressed patients tested were selected from the younger age range of 
this population. Like théschizophrenic patients, most were tested within 
two weeks to a month of admission. Three were inpatients at Cane Hill 
Hospital, and the remainder inpatients of the Maudsley Hospital. 
Nearly all these patients were tested prior to the onset of treatment. 
Unfortunately, this was not always possible. One patient had received 
three E.C.T. treatments prior to testing, while two others had received 
one E.C.T. treatment each. However, all were tested while still depressed. 
In spite of efforts to avoid it, six patients were being given drugs at the 
time of testing. In all but two cases, the dose was regarded as not large 
enough to affect test performance, although this judgment is subjective. 
Two were having 50 mg. of 'Largactil' three times daily, one was 
having 25 mg. of ‘Drinamyl’ twice daily and one was having 50 mg. of 
*Pacatal three times daily. The other two were having 150 mg. per day 
of ‘Pacatal’ when testing was commenced, but this was beingincreased at 
the rate of 50 mg. a day, so that on the final day of testing, they were 
having about 250 mg. daily. . . 

The twenty neurotic patients were subdivided into a group of eight 
‘hysterics’ and twelve ‘dysthymics’. The hysteric group consisted of 
six females and two males. The dysthymic group consisted of seven 
females and five males. Of the entire neurotic group, three were in- 
patients of Cane Hill Hospital, the remainder inpatients of the Maudsley 
Hospital. Nearly all the patients in this group were having fairly long- 
term psychotherapy. Most were tested between two weeks and a month 
after admission, although a few had been in hospital considerably longer. 
Most were free of drugs, apart from mild night sedation, at the time of 
testing. However, one case of ‘anxiety state’ was receiving 50 mg. of 
‘Largactil’ three times a day. One case of ‘reactive depression’ was re- 
ceiving small doses of ‘Equanil’ during the day, and a second similar 
case, small doses of Sodium Amytal during the day. The psychiatric 
diagnoses given to those classified as ‘dysthymic’ were as follows: 
"Reactive Depression’, 6; 'Obsessional Neurosis', 1; 'Obsessional neuro- 
Sis with anxiety’, 1; ‘Anxiety with depressive features’, 2; ‘Anxiety 
State’, 1; ‘Phobic Anxiety’, 1. The patients included in the 'hysteric 
group were diagnosed as follows: ‘Hysteria’, 1; ‘Hysterical reaction 
with multiple somatic symptoms’, 1; ‘Hysterical illness in hysterical 
personality’, 1; ‘Hysterical aphonia’, 1; “Fugue state’, 2; ‘Hysterical 
multiple personality’, 1; ‘Hysterical personality with psychopathic 
tendencies’, 1. 
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The data concerning the variables on which the groups were matched 
are presented in Table 1.1. 


TABLE 1.1 
Control data 
à p Schizo- 
" Normals Neurotics „| Depressives f 
Variabl FÉ ig y i hrenics 
nane (N = 20) | (N 220) | (N = 20) iN ipa 20) 
Mean 32:10 32-65 37-60 32:10 
Variance 89-47 104-68 165-74 69:26 
Age —— | — | ]- r 
range 18-51 19-55 19-65 18-48 
significance F = not significant 
Mean 10270 | 103-74 | 10135 | 101-26 
Mill Hil | Variance | 15842 | 20033 | 22573 | 157-22 
Vocabulary | - eee 
IQ range 78-126+ | 80-123 | 80-126+ | 80-126 + 
significance F — not significant 
‘ Mean 4-20 460 | 4:35 4:40 
Occupational| ^" al |ÀÀÀ ——— 
status Variance 2:06 1:94 | 2:98 2:46 
rating range 1-7 1-7 1-7 1-7 
scale -j nan 
significance F = not significant 
Mean 10:95 10-85 10:45 10-80 
Years of - € 
schooling Variance 10:47 3.32 4:89 3:37 
and range 8-22 9-16 8-17 9-14 
training — xm xi = : 
significance F — not significant 


In spite of the attempt to match the groups exactly for age, the mean 
age of the depressive group is still slightly higher. However, an ‘F’ test 
suggests that overall mean differences in age are not significant. 

The main aim of the matching was to ensure that the groups were 
similar in pre-illness general intellectual and educational levels. It is 
clearly not desirable to match the groups on a measure of general intel- 
ligence obtained during their illness, since thought disorder can reason- 
ably be expected to reduce efficiency on many cognitive tests, and this 
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would be controlling a variable which we wish to examine. There is 
considerable evidence (Payne, 1960) that vocabulary tests scored in the 
conventional way are least affected by mental illness. Thus, while this 
is not a perfect estimate of pre-illness ‘general intelligence’, it is prob- 
ably the best available. As can be seen the groups are almost exactly 
matched on the Mill Hill Vocabulary test (Raven, 1948). The ‘IQ’ 
scores quoted are based on a simple transformation from Raven’s 
published percentile nórms. The ‘IQ’ units in this case are made to 
have a mean of 100 and a standard deviation of 16 for each age-group. 
The Mill Hill Vocabulary test was chosen as it provides a reliable score, 
being based on both a ‘definitions’ and a ‘multiple choice’ vocabulary 
test and is standardized on an English population. 

The occupational status was assessed on Belson’s (1955) seven-point 
rating scale. The mean occupational status for each group is very close 
to the British average. The range in the present group is a trifle re- 
stricted, in that there are no representatives of either of the extremes 
of income level. This is inevitable with such small groups, as it is clearly 
not possible in a representative sample of twenty cases to include indi- 
viduals who occur only once in 1000 of the general population. In 
terms of the range of occupations sampled, and the variance on this 
rating scale, the groups can be regarded as reasonably representative 
of the British population in terms of occupational status. . 

The groups are well matched in terms of years of formal education. 
This measure includes years of university, teachers? training college, 
technical, or other full-time education in addition to years spent at 
School. 


V. THE TESTS USED AND THE MEAN DIFFERENCES OBTAINED 


A complete list of the measures derived from the battery of tests 
administered is given in Table 1.2, along with the code number used to 
denote each variable. These code numbers will be used in subsequent 
tables and diagrams. (The code numbers were used to tabulate the 
control data as well. Thus the first test code number is 9.) 


TABLE 1.2 
List of variables with code numbers 
Code 
Number Variable 


9 Mill Hill Vocabulary IQ. 
10  Brengelmann Picture Recognition test score. 
11 Object Classification test number of ‘A’ responses. 
12 Object Classification test number of ‘non A’ responses. 
13 Total time on Object Classification test. 
14 Average response time for Object Classification Test. 
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List of variables with code numbers (continued) 


Variable 
Average time per ‘A’ response for Object Classification Test. 
Luchins test: Number of simple solutions after set. 
Luchins test: Number of difficult solutiors after set. 
Luchins test: difficult solution to ‘control problem’ before set = 1, 
simple = 0. 
Epstein test, total time taken. 
Epstein test, Overinclusion score. 
Epstein test, Neologism score. 
Goldstein-Scheerer Colour-Form test ‘concreteness’ rating. 
Goldstein-Scheerer Object-Sorting test ‘concreteness’ rating. 
Goldstein-Scheerer Object-Sorting test ‘Overinclusion’ score. — 
Goldstein-Scheerer Object-Sorting test, Number of unusual sortings. 
Waves test: Average amplitude. 
Waves test: Average wavelength. 
Dynamometer test: maximum grip. 
Static ataxia: number of reversals. 
Static ataxia: total movement. 
Shaw test: Number of ‘A’ responses. 
Shaw test: Number of ‘B’ responses. 
Shaw test: Number of ‘C’ responses. 
Shaw test: Number of ‘D’ responses. 
Shaw test: total time. 
Shaw test: Average response time. 
Nufferno Level test score. 
Sum of two longest ‘incorrect’ or ‘abandoned’ item times on Level 
test. 
Nufferno Speed test A1, unstressed. 
Nufferno Speed test B1 unstressed. 
Nufferno Speed test A2 stressed. 
Word Association test: number of synonyms. 
Word Association test: repetition of stimulus word. 
Word Association test: number of multiple responses, 
Word Association test: number of aside remarks, 
Word association test: repetition of previous stimulus or response 
words. 
Babcock test: speed of writing ‘U.S.A.’ 
Babcock test: speed of writing sentence. 
Babcock test: speed of writing name. 
(Note: scores 47, 48 and 49 were combined, and are referred to as 
variable 47 in the factor analysis plots). 
Babcock ‘Substitution’ test average time per line, 
Time to draw three squares. 
Wechsler Arithmetic sub test. 
Wechsler Picture Arrangement sub test. 
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TABLE 1.2 
List of variables with code numbers (continued) 
Code 
Number Variable 


54  Disproportionality score, drawing design test. 
55 Average time: drawing design test. 

56 USES combined manual and finger dexterity score. 
57 Proverbs test ‘A’ score. 

58 Average number of words per proverb. 

59 Average reaction time per proverb. 

60 Average total time per proverb. 

61 Pathway I: time required. 

62 Pathway I: number of errors. 

63 Pathway II: time required. 

64 Pathway II: number of errors. 

65 Shaw test: Average ‘A + B' response time. 
66 Shaw test: Total number of responses. 

67 Shaw test: Average time per ‘A’ response. 

68 Static ataxia: maximum backwards sway. 


(a) Tests of Intellectual Speed 

It was pointed out in the introduction that Babcock (1930, 1933, 1941) 
regarded intellectual slowness as a primary cause of schizophrenic 
thought disorder. However, previous studies have suggested that de- 
pressed patients are even slower than acute or early schizophrenics. It 
is possible that these two groups are slow for different reasons. Depres- 
sives may be abnormally slow at all tasks, perhaps because they are 
constantly distracted by their depressive thoughts, whatever they are 
doing. Schizophrenic slowness on the other hand might be confined 
to intellectual tasks, because it is a by-product of overinclusive thinking. 
In the present study, intellectual speed was measured by the Nufferno 
Speed tests (Furneaux, 1956). These tests consist of Thurstone-type 
letter series items, the subject being required to write down the next 
letter of the series. Normal people, when solving a letter series problem, 
consider only a limited range of possible solutions. Indeed they regard 
only the sequence of the letters as being relevant to the task. It is 
possible that schizophrenics do not exclude other aspects of the letter 
Series items in considering possible solutions. For example, they might 
also take into account their associations to the letters. This in itself 
Would not necessarily produce an incorrect answer, since solutions in- 
Corporating irrelevant aspects of the material might nevertheless be 
Tejected if they did not fit. It would, however, produce on the average 
a slower solution time per item, since more information would have to 
be considered for each item. It should also lead to an increased variability 
of solution times. 

19 


R. W. Payne and J. H. G. Hewlett 


In the present investigation, both the Nufferno ‘Level’ test, and the 
Nufferno speed tests were used. The Level test makes use of precisely 
the same type of items as do the speed tests, namely Thurstone-type 
letter series items. In this test, each item is presented individually on a 
card. The subject is encouraged to take all the time he needs. The test 
items do not get progressively more difficult, but are arranged in cycles 
of seven items each, starting with easy, and ending with difficult items. 
The test is arranged so that each individual is given only items around 
the ‘threshold’ of his level of failure. Items he would always solve cor- 
rectly or would always fail (as determined by performance on the first 
cycle) are not given. Thus the ‘experience of failure (and presumably the 
accompanying discouragement) is controlled for all subjects. Normally 
the test is untimed, although in the present study the solution time for 
each level item was timed secretly (by the use of a wall clock with 
sweep second hand behind the subject). Only the first three cycles of 
the Level test were used. 

The raw scores obtained from this test were transformed linearly for 
ease of computation, as shown in Table 1.3. 


TABLE 1.3 
Level Test raw score transformations 


Raw Score Coded Score 
0-64 

65-129 
130-194 
195-259 
260-324 
325-389 
390-454 
455-519 
520-584 
585-649 


w000-10v th i 00 t2 — O 


The results for the five groups are presented in Table 1.4. 


TABLE 1.4 
Nufferno Level Test coded scores (Variable 37) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 6:05 5:08 5-13 415 5:05 
Variance 542 1:72 1:55 6:34 5:94 
Significance F — 1:85, not significant 


It is interesting that, although the groups were deliberately not 
matched on a test of general intelligence, on the grounds that thought 
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disorder might well reduce the scores, the Nufferno Level test does not 
differentiate the groups significantly. 

Three forms of the Nufferno speed tests (Furneaux, 1956) were used 
in the present study. Test Al consists of a group of easy items of homo- 
genous difficulty level. It was given individually under ‘unstressed’ 
conditions. That is, the subjects were told to work at their own rate, 
and they were unaware that they were being timed. This was followed 
by test B1, which is similar to Al, but consists of a homogeneous set of 
more difficult items. It was also given individually under ‘unstressed’ 
conditions. The third test given, test A2, is of the same level of difficulty 
as test Al. However, it was given individually under ‘stressed’ condi- 
tions. That is to say, the subjects were told to work as quickly as 
possible, and were aware that they were being timed. 


The scores obtained from all three speed tests are ‘mean log time’ p S 
Scores (i.e. the average of the logarithm of the solution time for each 8 g 
item solved correctly, excluding the average time required merely to zZ: 


write down the answer). The mean log time scores obtained from these 
tests were transformed linearly for ease of computation as shown in 
Table 1.5. 


meag 0 


TABLE 1.5 
Nufferno Speed Tests, transformations used 
Test A1 Test B1 Test A2 : 
Coded Coded : 
Mean log time poe Mean log time Score Mean log time Score 
0-8000-0-9230 0 0:9900-1-1280 0 0-7630-0-8730 0 
0:923 1-1-0460 1 1:1281-1:2660 1 0-8731-0-9830 1 
1:0461—1-1690 2 1:2661-1:4040 2 0:9831-1:0930 2 
1:1691-1-2920 3 1:4041-1:5420 3 1:0931-1-2030 3 
1:2921-1:4150 4 1:5421-1:6800 4 1-2031-1:3130 4 
1:4151-1:5380 5 1:6801-1:8180 5 1:3131-1:4230 5 
1:5381-1-6610 6 1-8181-1:9560 6 1:4231-1:5330 6 
1:6611-1-7840 7 1:9561-2:0940 T 1:5331-1:6430 7 
1-7841-1-9070 8 2-0941-2:2320 8 1:6431-1:7530 8 
1:9071-2-0300 9 2-2321-2:3700 9 1:7531-1:8630 9 


The results obtained are presented in Tables 1.6, 1.7 and 1.8. 


TABLE 1.6 
Nufferno Speed Test Al (individual, unstressed) coded scores (Variable 39) 


Normals Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 
zm 2:30 
Variance 3-06 
Significance 
by A 
sl 
NET 
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TABLE 1.7 
Nufferno Speed Test B1 (individual, unstressed) coded scores (Variable 40) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 19 18 
Mean 1-95 2:67 2-00 3-68 3-33 
Variance 1:94 1:88 4-00 3-34 1:88 
Significance F = 402, p = 0:01 

TABLE 1.8 


Nufferno Speed Test A2 (individual, stressed) coded scores (Variable 41) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 19 18 
Mean 1:80 2:50 2:25 3-84 3:50 
Variance 2:17 2:27 3.36 4:92 5-09 
Significance F = 3-58, p < 0:05 


These results are as expected. As in previous studies (Payne, 1960) it 
is the depressives who are slowest on all three tests, although the 
schizophrenics are slower than the neurotics and normals, (The various 
group comparisons which might have been made using ‘t’ tests have 
not been attempted, as this is not strictly permissible statistically.) 

Generally speaking, we might expect individuals who are unusually 
slow at problem-solving to tend to compensate for this by becoming 


TABLE 1.9 
Level Test, transformation applied to persistence scores 
Score in Transformed 
Seconds Score 
51-79 0 
80-123 1 
124-191 2 
192-297 3 
298-462 4 
463-717 5 
718-1114 6 
1115-1729 7 
1730-2685 8 
2686-4168 9 


more persistent. This might reasonably be expected of the schizophrenic 

group, if their slowness is the result of overinclusive thinking. With so 

many more data to consider in order to solve a problem, the over- 

inclusive thinker would have to become more persistent than average, 

if he were to reach any solution at all. However, this need not be the 
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case with the depressives. If they are slow because their worries distract 
them, they are likely also to become distracted from the task altogether, 
and thus give up if they cannot get an answer in a reasonable length of 
time. In order to examine these expectations, a measure of persistence 
was obtained from the Level test. This was the sum of the two longest 
times (in seconds) spent during the Level test on items which were sub- 
sequently abandoned or solved incorrectly. The distributions on this 
measure were markedly'skewed for all the groups, as is generally the 
case with untransformed time scores. In order to keep the variances for 
the groups equal as nearly as possible, and because it was intended to 
intercorrelate all the measures, a log transformation was applied to 
normalize these distributions. The transformation is shown in Table 1.9. 
The results for this score are given in Table 1.10. 


TABLE 1.10 
Nufferno Level Test persistence scores (transformed) (Variable 38) 


Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-90 3.92 3:25 3-50 4:30 
Variance 3:46 4-63 2:50 4:05 5:38 
Significance F = 0:57, not significant 


As expected, the schizophrenics have the highest mean persistence 
scores. However, the differences are not statistically significant. 


(b) Tests of Motor Speed 

It has been suggested that acute schizophrenics are slow only at intel- 
lectual tasks, because they are overinclusive. This was the case in the 
preliminary study (Payne, Matussek and George, 1959). On the other 
hand, depressives were expected to be slow at all types of activity. A 
number of simple motor speed tests were given to test this hypothesis. 
The tests used, all sub-tests from the Babcock-Levy (1940) test, were 
as follows: 


(1) The speed of writing ‘the United States of America’. The score 
was merely the number of seconds required (Variable 47). 

(2) The speed of writing one’s own name. The score was the number 
of seconds required (Variable 49). 

(3) The speed of writing a simple sentence from dictation. The sen- 
tence used, from Babcock’s battery (item 12 MC) was ‘I hope 
to leave here very soon’. 

(4) The Babcock-Levy ‘substitution’ test (test 7 in Babcock’s battery). 
This test merely requires the subject to write the appropriate 
number (from I to 5) in a symbol, using a key provided. The test 
would appear to be a function of simple perceptual and motor 
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speed, although some rote learning is probably also involved. 
Babcock's scores were not used. The score used in the present 
study was merely the average time in seconds required to com- 
plete each line of the test, there being five lines in all. 


In order to normalize the distributions, and make the variances as 
homogeneous as possible, logarithmic transformations were used as 
shown in Table 1.11. 

TABLE 1.11 
Transformations applied to Babcock Motor Speed Test raw scores 


Time to Write ‘United States of 


America' (Variable 47) Babcock Sentence (Variable 48) 
Time in Transformed Time in Transformed 
Seconds Score Seconds Score 

6-7 0 5-6 0 
8-10 1 7-8 1 
11-13 2 9-10 2 
14-16 3 11-12 3 
17-22 4 13-16 4 
23-28 5 17-20 5 
29-37 6 21-25 6 
38-48 7 26-32 4 
49-62 8 33-41 8 
63-80 9 42-52 9 

Speed of writing name Babcock ‘Substitution’ Test 

(Variable 49) (Variable 50) 
Time in Transformed Average time Transformed 
Seconds Score per line Score 

3 0 9-89-11-64 0 

4 1 11:65-13:71 1 

5 2 13:72-16:14 2 

6 3 16:15-19:00 3 

7-8 4 19-01-22:39 4 

9-11 5 22:40-26:36 5 
12-18 6 26:37-31:05 6 
19-24 T 31:06-36:56 7 
25-31 8 36:57-43:95 8 
32-40 9 43-96-50-70 9 


The results obtained are presented in Tables 1.12, 1.13, 1.14 and 1.15. 

As expected, on average the depressives were the slowest of all the 

groups. Quite contrary to prediction, however, the schizophrenics are 

clearly slower than the neurotics or the controls on all four tests. These 

results are not consistent with those obtained in the preliminary study 

(Payne, Matussek and George, 1959), and are not consistent with the 
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TABLE 1.12 
Speed of writing ‘United States of America’: transformed scores (Variable 47) 


Normals Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 T 20 20 
Mean 1-35 3-00 2:14 3-00 2:95 
Variance 1:29 2:00 0:48 3-37 5:21 
Significance si F = 343, p < 0:05 

TABLE 1.13 


Speed of writing sentence: transformed scores (Variable 48) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Numter of cases 20 12 T 20 20 
Mean 2:25 3-42 314 3-65 3-30 
Variance 1:04 0:99 0-81 2:24 3-59 
Significance F = 2:83, p < 0:05 

TABLE 1.14 


Speed of writing name: transformed scores (Variable 49) 


Normals  Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases 20 12 7 20 20 
Mean 245 425 4:00 4:35 4:95 
Variance 2:26 2:75 1:33 2:56 3:52 
Significance F = 651, p < 0-01 

TABLE 1.15 


Babcock ‘Substitution’ Test: transformed scores (Variable 50) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 225 2-75 2:38 4:55 3:90 
Variance 1:25 2:02 0-55 4:05 4:94 
Significance F — 603, p « 0:01 


hypothesis that slowness in acute schizophrenia is confined only to 
intellectual tests. Indeed these simple motor tests produce the same 
results as the Nufferno speed tests. Two possibilities are suggested. 
Schizophrenics may suffer from two quite unrelated abnormalities, a 
Beneral retardation which they share with depressives, and 'over- 
inclusion’ which they do not share with depressives. If this is the case, 
intellectual slowness in schizophrenia cannot be entirely attributed to 
overinclusion. The other possibility is that overinclusive thinking 
affects the performance even of simple motor speed tests. This seems less 
likely. Even if it did, it should affect problem-solving more than simpler 
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tasks. Yet there is no evidence that the two sorts of speed test are in any 
way different, in terms of their capacity to differentiate the groups. 


(c) Tests of ‘Concreteness’ 


(1) The Goldstein-Scheerer Sorting Tests. Previous studies suggest that 
schizophrenics are ‘concrete’ in their performance on these tests, if 
Goldstein and Scheerer’s (1941) scoring criteria are used. If the tests 
are scored in this way, however, the result will be ambiguous. Goldstein 
and Scheerer do not distinguish between what might be called a ‘bizarre’ 
sorting, and a failure to sort at all. Both performances are rated as 
‘concrete’. If the theory of ‘overinclusion’ is correct, and schizophrenics 
perceive more aspects of the test material as relevant to the task than 
do normals, they will have more data on which to base possible sortings. 
If they base their sorting on what normal people would regard as irrele- 
vant aspects of the material, the sorting will be judged to be bizarre, 
or at least unusual. However, if the schizophrenic does not distinguish 
between a ‘bizarre’ sorting and a normal, ‘abstract’ sorting, which type 
of response occurs first will be completely random. If Goldstein's 
method of administration is followed, and the initial solution is an 
unusual one (e.g. ‘patterning’), the subject’s performance will be rated 
as ‘concrete’, Furthermore, the subsequent test procedure will be altered, 
and in many cases the subject will not be allowed to continue to sort 
the material so that it will not be possible to determine whether or not 
he would later produce spontaneously an ‘abstract’ solution. 

It was predicted from the theory of overinclusion that, if Goldstein 
and Scheerer's scoring system were followed, the schizophrenics would 
tend to be ‘concrete’ on these tests, by virtue of their ‘overinclusive’ 
tendencies. (This was the case in the preliminary study.) However, these 
tests should not differentiate as well as tests specifically designed to 
measure ‘overinclusive’ thinking. There seemed no reason to predict that 
the depressives would be different from the normal and neurotic groups. 

Two tests of the Goldstein-Scheerer battery were used, the Colour- 
Form test and the Object-Sorting test. These were administered in ac- 
cordance with the instructions in the manual. Although the authors 
give a detailed account of the types of responses on these tests which 
they label ‘concrete’, they do not give a system of scoring them. A five- 
point rating scale was therefore used for each test. A score of ‘O 
indicates a completely ‘abstract’ performance while a score of ‘4’ indi- 
cates a completely ‘concrete’ performance. These rating scales were very 
carefully constructed on the basis of the criteria laid down in the 
manual, and each level of performance was defined as precisely as pos- 
sible. For example, rating ‘2’ on the Object-Sorting Test was defined 
as follows: “This score is given for subjects who require experiment IV, 
that is, subjects who fail to shift in experiment III (shifting), or who 
can think of only one acceptable alternative method of sorting (using 
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the "acceptable" methods listed in the manual) in experiment III. 
However, this score is only given if the subject then succeeds with experi- 
ment IV (coercive shifting). Success in experiment IV is defined as 
explaining correctly (and in ‘‘abstract’’ terms) the first four groupings 
presented by the examiner, i.e. “all metal", “all red", “all toys" and 
“all double occurrences". 

The results obtained with these two tests are presented in Tables 
1.16 and 1.17. ? 


TABLE 1.16 
‘Concreteness’ ratings fron: the Goldstein-Scheerer Colour-Form Test 
(Variable 22) 
Normals  Dysthymics  Hysterics Depressives Schizophrenics 
Number of cases 20 12 8 20 20 
Mean 0:50 0:83 0:88 1:05 2:35 
Variance 1:05 1:42 212 1:42 2:33 
Significance F = 4:62, p < 0:01 
TABLE 1.17 
“Concreteness’ ratings from the Goldstein-Scheerer Object-Sorting Test 
(Variable 23) 
Normals  Dysthymics  Hysterics — Depressives Schizophrenics 
Number of cases 20 12 8 20 20 
Mean 1:75 2:25 2:00 2:10 2:80 
Variance 0-62 0:75 0:29 1:25 112 
Significance F — 331, p « 0:05 


These results are as expected. The depressives mean scores are very 
similar to those of the neurotic and normal groups. On the other hand, 
the schizophrenics are more ‘concrete’ than the other groups on both 
tests. In the preliminary study (Payne, Matussek and George, 1959), 
only the Object-Sorting test ‘concreteness’ rating differentiated signi- 
ficantly between a neurotic and a schizophrenic group. It was thought 
that this was because the more heterogeneous material of the Object- 
Sorting test favoured the production of *overinclusive' responses which 
tended to be rated as ‘concrete’ when the Goldstein-Scheerer scoring 
Criteria were used. In the present study, the Colour-Form test differen- 
tiated slightly better than the Object-Sorting test. Almost certainly the 
high ‘concreteness’ ratings in the schizophrenic group on both tests 
are related to the production of unusual responses, and not to the 
inability to make abstract generalizations. The evidence that this is 
the case will be discussed later. 

(2) The Benjamin Proverbs Test. It has been argued that although 
Schizophrenics are consistently reported as being ‘concrete’ in their 
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explanations of proverbs, these findings could also be interpreted as 
being due to ‘overinclusive’ thinking. Benjamin’s proverbs test was 
given to the present groups in order to demonstrate that, when scored 
in the traditional way, the schizophrenic group will be moderately 
‘concrete’ but that this score will not differentiate as well as direct 
measures of ‘overinclusive’ thinking. Furthermore, this ‘concreteness’ 
rating should be significantly correlated with other tests of ‘overinclu- 
sion’. It was not expected that the depressive group would be abnormal 
on this test. 

Benjamin’s (1946) list of fourteen proverbs was used. The instructions 
used in the present study were as follows: ‘I am going to read a few 
proverbs, and what I want you to do is simply give the meaning of each 
one as simply as you can. Let me give you an example. “Too many 
cooks spoil the broth." This means that when too many people are con- 
cerned in a task, they are likely to be less efficient than a single person 
would be. Now I am going to read some more proverbs and I want 
you to explain what they mean. Also tell me when you have finished.’ 
(The proverbs were then read one at a time and the answers recorded 
verbatim, no other instructions being given.) 

The answer to each proverb was merely rated ‘abstract’ or ‘con- 
crete’ according to Benjamin's general criteria. The score was the total 
number of abstract answers given. If, however, no answer at all was 
given, that particular proverb was not counted (e.g. some subjects 
said they had never heard of some of the proverbs, and thus gave no 
answer. These were excluded.) When proverbs were omitted in this way, 
the score obtained was pro-rated so as to be a proportion of 14 (the 
highest possible score). For computational purposes, this score was 
transformed linearly as shown in Table 1.18. 


TABLE 1.18 
Transformation applied to the ‘Abstraction’ Scores from Benjamin Proverbs 
Test 
Number of Transformed 
Abstract Answers Score 
0,1 0 
2,3 1 
4,5 2 
6,7 3 
8,9 4 
10, 11 5 
12, 13 6 


The results obtained are presented in Table 1.19. 
These results are as expected. While the depressive group is a little 
less abstract than the normal and neurotic groups, it is probably not 
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TABLE 1.19 


‘Abstraction’ rating from Benjamin's Proverbs Test: transformed scores 
(Variable 57) 


Normals Dysthmyics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-40 3-17 4-00 2:75 2:25 
Variance 1:73 9 1:24 0:86 1:78 2:51 
Significance F = 3:36, p < 0:05 


significantly different (i.e. the F ratio would not be significant were the 
schizophrenic group excluded). However, the schizophrenics on average 
are more ‘concrete’ than the other groups. 

Evidence that this score does not imply an inability to generalize, but 
is related to ‘overinclusion’ will be presented later. 


(d) Tests of ‘Overinclusion’ 

(1) Measures Derived from Benjamin's Proverbs Test. If the concepts 
formed by schizophrenics are abnormally extensive, it is possible that, 
to them, a proverb means more than it does to normal people. Thus 
the concept it illustrates should be more difficult to define as it should 
be more complicated. One measure of overinclusive thinking would 
be the number of words required to explain each proverb. In the 
administration of this test, it should be noted that no time limit was 
imposed. The subjects were instructed to tell the examiner when they 
had finished, and what they said was recorded until they gave this 
Positive indication that they had finished. The score used was merely 
the average number of words per answer. In order to normalize the 
distributions, and keep the variances for the groups as constant as 
Possible, a log transformation was made as shown in Table 1,20. 

The results are given in Table 1.21. 


TABLE 1.20 
Transformation applied to average number of words per proverb 
Average Number Transformed 
of Words Score 
3-98-5-62 0 
5:63-7:94 1 
7-95-11-22 2 
11:23-15:84 3 
15-85-22-39 4 
22:40-31:62 5 
31:63-44-67 6 
44-68-63.10 7 
63-11-89-13 8 
89-14-126-00 9 
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TABLE 1.21 
Average number of words per proverb: transformed scores (Variable 58) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 2:50 317 4:25 3-00 4:45 
Variance 0-68 0-70 2:21 1-37 5:31 
Significance F = 552, p < 0-01 


As expected, the schizophrenics have the highest mean score. The 
depressives, on the other hand, are very close to the normal group. On 
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FIG. 1.1. Average No. of Words Per Proverb. 


this measure at least, there is no evidence that the depressives are abnor- 

mally overinclusive. This measure, while producing mean differences 

in the expected direction, had not differentiated significantly between 

schizophrenics and neurotics in the preliminary study (Payne, Matussek 
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and George, 1959). It was pointed out in that study, that it is extremely 
difficult, when copying down patients’ answers, to avoid paraphrasing 
them and condensing them. This tends to occur more with people who 
speak at length. In the present study, considerable care was taken to 
avoid this difficulty. If the examiner was unable to keep up with a 
patient’s response while recording verbatim, a count of the number of 
words used was kept by putting a dot on the record form for each word 
missed. An interesting feature in these results, is that the hysterics, 
unlike the normals, dysthymics or depressives, use nearly as many 
words in their answers as do the schizophrenics. This suggests that this 
score may not be a pure measure of ‘overinclusion’. Extraverted people, 
particularly those judged to have ‘hysterical personalities’ tend to talk 
more than average, and this would increase their score. Unfortunately, 
no other measure of extraversion was included in the present battery 
of tests, so this hypothesis cannot be tested on the present data. The 
histograms illustrating the performance of each group on this score, 
are presented in Fig. 1.1. Note that the scores are not transformed, but 
are expressed as the average number of words used. ] 

If overinclusive subjects are explaining a more extensive and difficult 
concept when interpreting proverbs, in addition to requiring more 
words, they should need longer to prepare what they are going to say. 


TABLE 1.22 
Transformations applied to time scores: Proverbs Test 


Mean Reaction Time Mean Total Time 


Average Time Transformed Average Time Transformed 
(seconds) Score (seconds) Score 
2:14-3-05 0 6:38-8:51 0 
3:06-4:37 1 8:52-11:35 1 
4:38-6:24 2 11:36-15:14 2 
6:25-8:91 3 15:15-20:18 3 
8:92-12:74 4 20:19-26:92 4 
12:75-18:20 5 26:93-35:89 5 
18:21-26:00 6 3590-47-86 6 
26:01-37:15 7 47-87-63-83 7 
37-16-53-09 8 63-84-85-11 8 
53-10-75-86 9 85-12-113-50 9 


That is, they should have an increased ‘reaction time’, if this is defined 
as the time (in seconds) elapsing between the presentation of the proverb 
by the examiner, and the first word spoken by the subject in answering. 
Their explanation itself should also take longer. Thus, two further 
Measures of ‘overinclusion’ used, were the ‘average reaction time’ for 
each proverb and the ‘average total time’ (in seconds) required to give 
the answer. Since both scores are time measures, however, they will not 
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be pure measures of ‘overinclusion’. People who are merely slow, will 
tend to have abnormal scores, just as will people who are slow because 
they are overinclusive. It was expected that these two scores would 
measure ‘slowness’ and ‘overinclusion’ equally. Thus, both depressives 
and schizophrenics should tend to be abnormal on these two measures. 

In order to normalize the distributions, and keep the variances equal 
for each group as far as possible, log transformations were used for 
these two scores, as shown in Table 1.22. 9 

The results are presented in Tables 1.23 and 1.24. 


TABLE 1.23 
Mean reaction time for Proverbs Test: transformed scores (Variable 59) 


Normals  Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 2:55 3:50 2.13 3-95 3.95 
Variance 3-00 3:73 1:84 4:16 2-50 
Significance F = 3:08, p < 0:05 

TABLE 1.24 


Mean total time for proverbs: transformed scores (Variable 60) 


Normals  Dysthymics — Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 1-80 3:17 2-50 3:35 474 
Variance 1:75 415 0:86 2:87 2:76 
Significance F = 8-64, p < 0-01 


These results are more or less as expected. On the reaction time score, 
the depressives and schizophrenics are equally slow, the other groups all 
being faster. Schizophrenics are slowest in average total ‘explanation’ 
time, while depressives are less slow, although slower than the other 
groups. Presumably while retardation in the depressive group produces 
slowness on this measure, a longer answer (due to ‘overinclusion’) 
produces even greater slowness. 

It is worth commenting on the scores produced by the two neurotic 
groups. Although the hysterics use more words, they are quicker in 
giving their answer than are the dysthymics. Again these differences may 
reflect personality characteristics, dysthymics tending to be slower, more 
thoughtful and more concise, hysterics tending to be faster and more 
verbose. These differences are compatible with what is known of the 
cognitive differences between these two groups (Payne, 1960). It empha- 
sizes the possibility that the individual measures of overinclusion used 
in this study may measure specific factors in addition to ‘overinclusion’ 
(cf. chapter by Slater). 

(2) Measures Derived from the Goldstein Object-Sorting Test. It has 
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already been argued that schizophrenics obtain their ‘concreteness’ 
rating on this test, not because they are unable to make an abstract 
generalization, but because they tend to produce unusual methods of 
sorting the objects. The number of unusual methods of sorting should 
provide a measure of ‘overinclusion’. This measure was derived from 
the ‘sorting’ section of the test (experiments II and III in the Goldstein- 
Scheerer (1941) manual). An ‘unusual’ sorting was defined as any 
method of grouping the objects not listed by Goldstein and Scheerer 
when they describe the various ‘abstract’ ways in which the objects can 
be grouped. 
The results obtained are presented in Table 1.25. 


TABLE 1.25 


Number of unusual sortings—Goldstein-Scheerer Object-Sorting Test 
(Variable 25) 


Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 0-45 1-00 1-25 0:40 125 
Variance 0:26 0:91 1:07 0:57 1:46 
Significance F = 3:64, p < 0-01 


As expected, the schizophrenics produce more unusual responses 
than do the normals. There is no evidence that the depressives are over- 
inclusive on this measure, as their mean score is nearly identical with 
the normals. The hysterics unexpectedly produce as many unusual 
methods of sorting as do the schizophrenics. This result is specific to 
this test. Several other measures of unusual types of sorting were ob- 
tained, and on none of the others do the hysterics obtain extreme scores. 
This could be a chance finding. In any event, no explanation can be 
offered. . : 

During the first stage of the administration of the Object-Sorting 
test (the *Handing-Over' experiment—experiment I), the subjects are 
asked to select one of the objects on the table. They are then asked to 
hand over to the examiner all the objects which might be grouped 
together with this object. As each object (including the first) is handed 
Over, it is placed out of sight by the examiner. This experiment thus 
requires the subject to build up a concept around the initial object. If 
Schizophrenics tend to form overinclusive concepts, they should select 
significantly more objects as belonging together during this part of the 
experiment. . 

Although Goldstein and Scheerer allow the subjects to select their 
Own objects as starting points in this stage of the experiment, Payne, 
Matussek and George (1959) standardized this aspect of the test to 
Some extent. Their procedure was used in the present experiment. The 

E.P. II—D 33 


R. W. Payne and J. H. G. Hewlett 


‘handing-over’ experiment is performed four times. On the first trial, 
the subject was allowed to select his own object. For the remaining 
three trials the examiner selected the object which was to be the ‘point 
of departure’. The objects were always the red plate, the box of matches 
and the bicycle bell in that order. All the subjects were given the ‘male’ 
set of objects described in the manual, regardless of sex. The object- 
sorting test material used in the present investigation is shown in Plate I. 
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FIG. 1.2. Overinclusion Score. 


The overinclusion score used was merely the average number of 
objects grouped together with the four objects used as ‘points of depar- 
ture’. Fig. 1.2 shows the histograms for the groups for the untransformed 
scores. 

In order to normalize the distributions, and equate the group vari- 
ances as far as possible, a log transformation was applied to this score 
as shown in Table 1.26. The results are presented in Table 1.27. 
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TABLE 1.26 


Transformation applied to Overinclusion Scores: Goldstein-Scheerer 
Object-Sorting Test 


Transformed 
Score Score 
2:23-2:91 0 
2:92-3:69 1 
3-70-4-92 2 
4:93-6:39 3 
6:40-8:31 4 
8:32-10:81 5 
10:82-14:06 6 
14:07-18:28 7 
18:29-23:76 8 
23-77-31-00 9 


TABLE 1.27 


‘Overinclusion’ scores from the Goldstein-Scheerer Object-Sorting Test: 
transformed scores (Variable 24) 


Normals Dysthymics Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 

Mean 2:10 233 2.88 205 3-75 
Variance 1:78 1:51 242 1:00 4-62 
Significance F = 425, p < 001 


These results are as expected. The schizophrenics have the highest 
mean overinclusion score. On the other hand, the depressives obtain a 
mean score almost identical with that of the normal group. 

(3) The Object Classification Test. This test was originally designed 
to provide an easier and more objective method of measuring *concrete- 
ness' in sorting behaviour, as defined by Goldstein (Payne, 1954). The 
aim was to incorporate all the features of the three Goldstein-Scheerer 
Sorting tests in one test. 

The test consists of twelve small objects: four triangles, four squares 
and four circles (see Plate IT). The objects vary in weight, thickness, size, 
material, and the hue and brightness of the colours which they are 
painted. There are intended to be ten different ways of grouping the 
objects as follows: (1) by shape (three groups, squares, circles and 
triangles), (2) by size (two groups, some are large and some are small), 
(3) by thickness (two groups, half the objects are twice as thick as the 
others), (4) by surface area (six groups, those of the same shape and 
Surface size have an identical surface area), (5) by presence or absence 
of curved edges or corners (two groups), (6) by weight (at least three 
Broups. This sorting must be independent of density), (7) by material 
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(three groups, some are metal, some wood, and some a light plastic. 
This sorting is allowed if the subject is unable to distinguish between 
the wood and plastic objects, making only two groups), (8) by colour 
(eight groups, those of precisely the same colour are grouped together), 
(9) by hue (four groups, red, blue, green and yellow), (10) by saturation 
of colour (two groups, some colours are highly saturated and ‘bright’ 
looking, some are unsaturated and ‘dull’ looking). 

The instructions used with the test are very simple. They are as fol- 
lows: ‘Do you see these objects on the table? I want you to sort them 
out into groups. Put them in some sort of order. Put those together 
which you think belong together.’ (When the subject has produced one 
of the 10 sortings, say:) ‘Good. Why did you do it that way? (If the 
subject’s explanation is correct, proceed with the test. Note that the 
explanation need not be completely ‘abstract’ in Goldstein’s sense, just 
sufficiently clear to be sure the subject has really perceived that particu- 
lar sorting method. If the subject cannot explain his grouping, or if he 
cannot produce one of the ten methods of grouping, the examiner 
demonstrates the ‘shape’ sorting, but does not subsequently credit it. 
Otherwise the initial sorting is credited. The final instructions are as 
follows :) ‘Now there are other ways of sorting this material. I want you 
to sort the objects in as many other simple ways as you can. Arrange the 
objects into groups as you have just done, so that there is a single 
simple reason for each grouping. Each time you sort them, I want you 
to tell me why you have sorted them in that way.’ (No further instruc- 
tions are given at all, but each explanation is recorded. There is no time 
limit. The first time the subject spontaneously says he can do no mcre, 
he is encouraged to continue. The test is ended when he gives up a 
second time. All the responses are timed secretly. This was done in the 
present experiment by means of a wall clock with sweep second hand 
behind the subject. The time for each response, including the initial one, 
is taken from the time the subject has been told to sort the objects, or, 
during the main part of the test, from the time he has finished his last 
explanation. The time taken for him to explain his method of sorting 
to the examiner is not included, since presumably he is not thinking of 


TABLE 1.28 
Object Classification Test: number of ‘A’ responses (Variable 11) 


Normals | Dysthymics Hysterics ^ Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 4:20 4-00 3:38 3-15 2:80 
Variance 2:59 2:73 2:84 4:24 542 
Significance F — 176, not significant 


a new method of sorting while giving this explanation. Thus each re- 
sponse time starts when he begins to sort, and ends when he begins his 
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explanation to the examiner. The ‘total time’ score is merely the sum 
of these response times.) 

If it is true that schizophrenics are not ‘concrete’, but are ‘overinclu- 
sive’, a number of predictions can be made concerning their sorting 
behaviour on this test. Since schizophrenics will perceive more aspects 
of the material as being relevant to the task than will normal people, 
they will be able to elaborate many more methods of sorting. These 


FIG. 1.3. Object Classification Test ‘non-A’ responses. 


extra methods of sorting would not be considered by normal people 
because they ignore the aspects of the material on which they are based 
(c.g. the shadows cast by the objects on the table, slight scratches on 
the objects, personal associations to the objects and so on). Schizo- 
Phrenics should, therefore, take longer to do the test because they 
think of more solutions. They should not distinguish between ‘normal’ 
and ‘unusual’ solutions, and should, therefore, produce both types of 
37 


R. W. Payne and J. H. G. Hewlett 


sorting in a random order. If there were no time limit for the test, there 
should be no reason why they should not produce as many ‘abstract’ or 
‘normal’ methods of sorting as normal people do, although these wouid 
be randomly interspersed among the other solutions. In practice, of 
course, all the subjects will tend to give up the test after working on it 
for a period of time, so that it would not be surprising to discover that, 
while schizophrenics produce many more solutions, they nevertheless 
produce slightly fewer ‘abstract’ or ‘usual’ solutions than normal people 
before they give up. 

A ‘usual’ or correct response was defined as one of the ten groupings 
intended in the construction of the test. It was denoted as an ‘A’ 
response. Table 1.28 gives the average number of ‘A’ responses obtained 
by the five groups. 

This score could be taken as a measure of the ability to ‘abstract’. 
As can be seen, the groups are not significantly different, although the 
schizophrenics have a slightly lower mean score, as expected. There is 
thus no evidence on this test, that schizophrenics are abnormally 
‘concrete’. 

The ‘unusual’ methods of sorting this test material were called 
‘non-A’ responses. A ‘non-A’ response was defined as any response 
not included in the list of ten ‘A’ responses, or a repetition of a pre- 
viously given ‘A’ response. The ‘non-A’ response score should clearly be 
a measure of ‘overinclusion’. The number of ‘non-A’ responses given by 
the groups is shown in the histograms in Fig. 1.3. 

The ‘non-A’ response score was transformed for ease of computation 


as shown in Table 1.29. 
TABLE 1.29 


Transformation applied to number of ‘non-A’ responses. 
Object Classification Test 


No. of ‘non A’ Transformed 
Responses Score 
0-2 0 
3-5 1 
6-8 2 
9-11 3 
12-14 4 
15-17 5 
18-20 6 
21-23 ji 


The results for this measure are given in Table 1.30. 

As expected, it is the schizophrenics who are extremely overinclusive 
on this measure. The depressives obtain a mean score very close to 
those obtained by the neurotics and the normals. Again there is no 
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TABLE 1.30 


Object Classification Test: ‘non-A’ responses: transformed scores 
(Variable 12) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 0:20 0-33 0-13 0-60 2:05 
Variance 027 ' 024 0:24 2:46 3:63 
Significance F = 685, p < 0:01 


evidence of abnormal overinclusion in the depressive group. It is clear 
from these results that, as expected, schizophrenics can obtain many 
more sortings than normals on this test; a normal number of ‘usual’ 
or ‘abstract’ sortings, as well as a large number of ‘unusual’ responses 
based on aspects of the material ignored by normal people. Space does 
not permit a list of all the types of unusual sortings obtained from this 
group. They were by and large what was predicted. For example, one 
man classified the objects in accordance with the type of shadows they 
cast on the table. As the sun moved around during the course of the 
test, he was able to produce a second ‘shadow’ classification at the end 
of the test, as the shadows had changed. Normal people assume that 
this is a two-dimensional test. That is to say, the groups must be laid out 
on the table. However, if a third dimension is introduced, many more 
groupings are possible. Some schizophrenics utilized this additional 
complication by forming their groups into piles of a certain order. The 


TABLE 1.31 
Transformation applied to total time, Object Classification Test 
Total Time Transformed 
(seconds) Score 
31-48 0 
49-75 1 
76-115 2 
116-178 3 
179-275 4 
276-424 5 
425-654 6 
655-1009 7 
1010-1556 8 
1557-2399 9 


condition of the surface of the objects, and various types of associations 

to the objects were also used to produce additional methods of sorting. 

As predicted, the schizophrenics tended to produce usual and unusual 

Tesponses in a random order, although no objective measure of this 
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has been used. When unusual responses were given by the other groups, 
it tended to be at the end of the test, just before they gave up. 

Several time scores were derived from this test. Overinclusive sub- 
jects, because they produce more classifications and have more ‘data’ 
to think about, should spend a longer total time on the test. However, 
the ‘total time’ score will not be a pure measure of overinclusion, Sub- 
jects who are merely slow will also have high scores. It was therefore 
expected that this score would measure both overinclusion and general 
retardation. Depressives as well as schizophrenics should tend to obtain 
extreme scores. In order to normalize the distributions, and keep the 
group variances relatively homogeneous, the log transformation shown 
in Table 1.31 was applied to this score. 

The results obtained are presented in Table 1.32. 


TABLE 1.32 
Object Classification Test Total Time: transformed scores (Variable 13) 


Normals Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-95 442 3-50 4-45 5:95 
Variance 1-84 1:72 2:57 3.84 3:00 
Significance F = 515, p < 001 


The schizophrenics have the most extreme scores on this measure. 
As expected, depressives are also slower than normals, although the 


TABLE 1.33 


Transformation applied to average time per ‘A’ Response—Object 
Classification Test 


Average Time 


per ‘A’ Response Transformed 

(seconds) Score 
11-15 0 
16-23 1 
24-37 2 
38-63 B 
64-127 4 
128-296 5 
297-1000 6 
1001-8000 4 
8000 + 8 


difference is small. Presumably the number of responses given is the 
main determinant of this score. 
Thesecond timescore derived was the ‘Average time per “A”’ response’. 
This was merely the total time taken, divided by the number of ‘A’ 
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responses given. Individuals with a high proportion of ‘non-A’ responses 
would tend to have high scores, so that this score should be a function 
of ‘overinclusion’. However, retarded subjects should also tend to have 
high scores merely because they are slow. In order to normalize the dis- 
tributions, and roughly equate the group variances, the log transforma- 
tion shown in Table 1.33 was applied. 
The results obtained for this score are presented in Table 1.34. 
e 


TABLE 1.34 


Object Classification Test average time per ‘A’ response: Transformed scores 
(Variable 15) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-15 3-50 3.13 4-25 5:35 
Variance 1:61 1:00 1:27 3:57 3-29 
Significance F = 635, p < 0:01 


The results are as expected. Schizophrenics have the largest mean 
Score, presumably because of their large number of ‘non-A’ responses. 
Depressives have a larger score than the other groups, presumably 
because they are slow. 

The third time score derived was the average response time. This 
Score was based on all types of sorting, and was merely the rate at which 


TABLE 1.35 


Transformation applied to average response time scores, 
Object Classification Test 


Average 
Response Time Transformed 

(seconds) Score 
13-16 0 
17-21 1 
22-28 2 
29-37 3 
38-48 4 
49-63 5 
64-83 6 
84-109 d 

110-142 8 

143-186 9 


Iesponses were produced. If schizophrenics consider more data than do 

normals, the test for them should be more difficult in the sense that it is 

more complex. For each sorting method perceived, more aspects of the 

material would have to be considered, both in isolating the principle 
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and in allocating the objects to the groups. When more types of sorting 
are produced, it should also take more time to recall whether or not 
this method had already been given previously. Thus, overinclusive 
thinking should produce an increased average response time. This 
measure would, of course, also be a direct measure of retardation for 
those who are slow but not overinclusive. In order to normalize the 
distributions and roughly to equate the group variances, a log trans- 
formation was applied as shown in Table 1.35: 
The results obtained are presented in Table 1.36. 


TABLE 1.36 


Object Classification Test average response time: Transformed scores 
(Variable 14) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-95 4:67 3-75 5:15 5:35 
Variance 2:79 2-61 3-07 3:27 3:17 
Significance F = 2:56, p < 0:05 


As expected, the depressives and schizophrenics are nearly equally 
slow on this measure. It is interesting that again, the hysterics are 
slightly quicker than the dysthymics, a common finding in the literature 
(Payne, 1960). 

(4) The Shaw Test. The Shaw Test is a test of concept formation very 
similar to the Object Classification Test, and it was included to replicate 
the findings on the Object Classification Test with slightly different test 
material. The set of blocks comprising the test material is illustrated in 
Plate III. The blocks were originally constructed by Mr. Harold Shaw 
of the University of Toronto. Howson (1948) used them in an investiga- 
tion of intellectual impairment associated with brain damage. Bromley 
(1956) used them in a study of the effect of age on creative intellectual 
output, and Bromley’s method of administration and scoring were used 
in the present study. 

The material consists of four wooden blocks varying in colour, height, 
shape, size and weight. There are four shapes: hexagon, pentagon, 
square and triangle. There are four degrees of the colour grey, each 
block becoming progressively lighter than the dark grey of the hexagon. 
In addition, the blocks have other features by which they may be 
graded in a number of sequences. On the bottom of each block is one 
of four names: rat, lamb, ox and whale. The size of the printing varies 
from block to block. On the top surface, each block has one of four 
letters: A, B, C and D, and there is a hole in the centre of this surface 
varying in diameter and depth from block to block. Along the side of 
each block there is a notch varying in width, depth and position relative 
to the corner of the block. 
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The test instructions used were as follows: ‘Would you pick up each 
of these blocks and examine them? You will see that they can be ar- 
ranged in a kind of logical order from left to right, that is to say in a 
sequence or series.’ (Pause.) ‘There are a number of ways of arranging 
them. Can you find one way to begin the test?’ (If the subject was 
unable to arrange the blocks in an acceptable manner, the ‘ABCD’ 
sequence was demonstrated, and the subject asked to repeat this. If 
the subject could not find a further sequence, height was demonstrated, 
and the subject requested to repeat this order. The subject was then 
asked to find as many other arrangements as possible. No time limit 
was used, the criterion for ending the test being the same as in the 
Object Classification test. The same timing procedure was also followed, 
to obtain individual response times, and from these, the total time.) 

There are intended to be 15 ‘correct’ or ‘abstract’ ways of ordering 
the test material. These were scored as ‘A’ responses, and consist of the 
following arrangements: (1) ABCD—by the letters, (2) Height, (3) 
Weight, (4) Colour, (5) Diameter of hole, (6) Depth of hole, (7) Number 
of sides, (8) Length of sides, (9) Depth of notch, (10) Length of notch, 
(11) Position of notch, (12) Alphabetical order of animals (from the 
names printed on the underside), (13) Number of letters in name of 
animal, (14) Size of letters in name of animal, (15) Size of the animal 
referred to. 

Instead of scoring all the other possible responses together as ‘non-A’ 
responses, as in the case of the Object Classification test, Bromley has 
produced a scoring system for classifying them. This was followed in 
the present study. A ‘B’ response is usually a response involving some 
combination of the more fundamental variables. Bromley’s complete 
list will not be given here, but some examples of ‘B’ responses are as 
follows: (1) gross size of notch, (2) reversals of ‘A’ sequences, (3) 
sequences based on very fine details, c.g. thickness of the ink in the let- 
tering, (4) volume of the blocks, (5) gross number of edges, surfaces 
or corners, (6) importance of the animals to man. 

In general *C' responses according to Bromley's list are a group of 
less adequate responses. Examples from the list he gives are as follows: 
(1) grouping instead of serializing, (2) making anagrams with the single 
letters or attempting to make sentences with the words, (3) inane re- 
Sponses, (4) strength or docility of animals, etc. Many responses in this 
category involve the subject's associations to the objects. 

D' responses are less adequate still. A ‘D’ response is scored for each 
demonstration required at the beginning of the test. It also includes such 
things as describing or naming the blocks, attempting to fit them to- 
gether, and building towers. 

The same predictions were made for this test as were made for the 
Object Classification Test, the rationale being identical. The ‘A’ re- 
sponses can be regarded as ‘usual’, ‘abstract’ responses. Theoretically, 
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given unlimited time, schizophrenics should produce as many ‘A? re- 
sponses as normal people. However, the Shaw test differs from the 
Object Classification test in being more complex. Thus it provides more 
scope for a large number of unusual responses. If this occurred, the 
test could become so long that many overinclusive subjects would be 
likely to tire, and give up the test before they had exhausted all the 
possibilities it had for them. This would have the effect of reducing the 
number of ‘A’ responses they gave, if, as expected, from the theory, 
responses of all sorts are given in a random order by overinclusive 
subjects. 

The ‘A’ response score was transformed linearly for ease of compu- 
tation as shown in Table 1.37. 


TABLE 1.37 
Transformation applied to ‘A’ response scores, Shaw Test 
Number of Transformed 
‘A’ Responses Score 
0,1 0 
TME 1 
4,5 2 
6,7 3 
8,9 4 
10, 11 5 
12; 13 6 
14, 15 7 


The results obtained for this score are presented in Table 1.38. 


TABLE 1.38 
Shaw Test ‘A’ Responses: transformed scores (Variable 31) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-80 3-00 3-38 2:30 2:55 
Variance 4-80 2:73 1:98 4:01 5:10 
Significance F = 1-68, not significant 


In spite of the fact that the schizophrenics might have been expected 
to give fewer ‘A’ responses, the differences between the groups are insig- 
nificant. Again, there is no evidence that they are abnormally ‘concrete’ 
in the sense of being unable to produce a normal, abstract generalization. 

In general, it was predicted that schizophrenics should produce more 
of all the other types of responses than normals, because they are over- 
inclusive. This prediction cannot be made for the ‘B’ responses how- 
ever. Bromley introduced the category of ‘B’ responses to cover re- 
sponses which are usual in normal people, but from a logical point of 
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view are slightly inferior. They are not responses based on unusual 
aspects of the material. Thus, from the point of view of the present 
study, they can be regarded as identical with ‘A’ responses. The ‘B’ 
response score was transformed linearly for ease of computation, as 
shown in Table 1.39. 


K TABLE 1.39 
Transformation applied to ‘B’ response score: Shaw Test 
Number of ‘B’ Transformed 
Responses Scores 
0,1 0 
2,3 1 
4,5 2 
6,7 3 
8,9 4 
10, 11 5 
12; 13 6 


The results obtained for this score are presented in Table 1.40. 


TABLE 1.40 


Shaw Test ‘B’ responses: transformed scores (Variable 32) 


Normals  Dysthymics — Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 0-15 0:17 0:00 0-05 0-55 
Variance 0-13 0-18 0-00 0-05 2.05 
Significance F = 1-25, not significant 


The number of ‘B’ responses do not differentiate the groups signi- 
ficantly. Again, there is no evidence that schizophrenics are unable to 
Produce normal generalizations. 

The ‘C’ responses in Bromley’s scoring system are mainly comprised 
of responses based on ‘irrelevant’ aspects of the test material, or per- 
sonal associations to the blocks. This score should, therefore, be a good 


TABLE 1.41 
Transformation applied to ‘C’ response score, Shaw Test 
Number of 'C' Transformed 
Responses Score 

0 0 
1-5 1 
6-39 2 
40+ 3 
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measure of overinclusion. The results obtained with this score are 
shown in Fig. 1.4. in the form of histograms. 


01232458 7 B. B: 10 
FIG. 1.4. Shaw Test. Number of C Responses. 


Because of the very deviant scores obtained by some of the schizo- 
phrenics, a fourth root transformation was needed to produce approxi- 
mately normal distributions on this measure. The scores were trans- 
formed as shown in Table 1.41. 


The transformed scores for the five groups are given in Table 1.42. 
TABLE 1.42 
Shaw Test ‘C’ responses: transformed scores (Variable 33) 


Normals Dysthymics  Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 0-60 0-58 0-75 0:70 1-25 
Variance 0:47 0:27 0:21 0:33 0-51 
Significance F = 3:87, p < 0:01 
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As predicted, the schizophrenics have a higher mean ‘C’ response score 
than the other groups. Again there is no evidence that the depressives 
are overinclusive, as their mean ‘C’ response score is similar to those of 
the other groups. 

The ‘D’ responses on the Shaw test, are not, strictly speaking, all 
‘responses’. This score is given when the subject needs a demonstration 
at the beginning of the test. It is also scored when subjects attempt to 
fit the blocks together,'or merely build a tower. Thus ‘D’ responses can 
be scored when the subject fails to produce any method of ordering 
the blocks at all. In that sense, it is also a measure of ‘concreteness’. 
The results obtained for this score are presented in Table 1.43. 


TABLE 1.43 
Shaw Test number of ‘D’ responses (Variable 34) 


Normals  Dysthymics  Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 0-40 0:33 0-63 1:00 1:50 
Variance 0-67 0:79 113 2:53 437 
Significance F — 1:93, not significant 


The number of ‘D’ responses did not differentiate the groups sig- 
nificantly, although the schizophrenics obtained the highest mean 
Scores, not because of their failure to respond, but because of some of 
their unusual responses. A 

It was expected from the theory of overinclusion, that schizophrenics 
should produce more responses of all sorts on the Shaw test, disregard- 
ing categories. The total number of responses was not normally dis- 
tributed, so that the cube root transformation shown in Table 1.44 
was applied. 

The results for this score are given in Table 1.45. 


TABLE 1.44 
Transformation used with total number of responses, Shaw Test 
Total Number Transformed 
of Responses Score 
1,.2 0 
3,4 1 
5-7 2 
8-12 3 
13-18 4 
19-27 5 
28-37 6 
38-49 T 
50-64 8 
65-82 9 
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Shaw Test total number of responses: transformed scores (Variable 66) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-20 2-15 2.88 2:40 3:55 
Variance 1-01 1-48 0-48 1-62 479 
Significance F = 1-75, not significant 


While the mzan scores for the groups are in the expected positions, 
the differences overall are not statistically significant. This may be due 
to the length and complexity of the Shaw test. It may be that, when 
normal people give up the Shaw test, they have more or less exhausted 
the possibilities it has for them. However, the test takes so long, that 
overinclusive subjects tend to give up the test long before they have 
produced all the methods of classification they might think of. Thus 
they do not produce many more responses than do the normals, largely 
because they tire of the test. If this were the case, we should expect to 
find that the groups all tended to spend a similar length of time on the 
test. This information is available in terms of the ‘total time’ score. The 
log transformation shown in Table 1.46 was applied to this score in 
order to normalize the distributions. 

The results are presented in Table 1.47. 


TABLE 1.46 
Transformation used with total time, Shaw Test 
Total Time Transformed 

(seconds) Score 
29-46 0 
41-15 1 
76-121 2 

122-196 3 

197-317 4 

318-512 5 

513-827 6 

828-1336 7 

1337-2157 8 
2158-3483 9 
TABLE 1.47 


Shaw Test total time: transformed scores (Variable 35) 


Normals  Dysthymics Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 4:60 425 4:63 5-00 5-70 
Variance 2:25 3-30 1-70 2:63 4:54 
Significance F — 1-69, not significant 
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As expected, the groups do not differ significantly in terms of the 
total time spent on this test. Thus, while both the total number of 
responses and the total time should theoretically be partly a function of 
overinclusion, they may not be very valid measures when derived from 
the Shaw test, because of its length. 

The second time score derived from the Shaw test was the average 
time per ‘A’ response. It was merely the total time, divided by the 
number of ‘A’ responses. Overinclusive individuals who have a higher 
proportion of *non-A' responses, should obtain high time scores. It was 
thus expected that the schizophrenics would have high scores. However, 
à general retardation could also produce an increased 'A' response time, 
so that this score should not be a pure measure. Depressives should 
also tend to have high scores. The reciprocal cube root transformation 
Shown in Table 1.48 was applied to this score to normalize the 
distributions. 


TABLE 1.48 
Transformation applied to average time per ‘A’ response, Shaw Test 
Average Time Transformed 
(seconds) Score 

10-13 0 
14-18 1 
19-28 2 
29-44 3 
45-77 4 
78-150 5 
151-356 6 
357-1203 7 
1204-9633 8 
9634+ 9 


The results are presented in Table 1.49. 


TABLE 1.49 
Shaw Test average time per ‘A’ response: transformed scores (Variable 67) 


Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-60 3-75 3-75 5-00 535 
Variance 2-78 0:57 1-07 232 2:03 
Significance F = 583, p < 0:01 


As expected, the schizophrenics have the most extreme scores, pre- 
Sumably because of their higher proportion of ‘non-A’ responses. De- 
Pressives have higher scores than the neurotics and normals presumably 

&cause of their general slowness. 
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As has been mentioned, on the Shaw test, ‘A’ and ‘B’ scores are 
nearly identical in the sense that both are ‘usual’ responses. Thus, the 
third time score examined has precisely the same rationale as the 
"Average time per “A” Response’. It is the ‘average time per “A” or 
“B” response’ and is merely the total time divided by the total number 
of ‘A’ + ‘B’ responses for each subject. The log transformation shown 
in Table 1.50 was applied to this score to normalize the distributions. 


TABLE 1.50 
Transformation applied to average time per ‘A’ or ‘B’ response, Shaw Test 


Average Time Transformed 
(seconds) Score 
9-16 
17-29 
30-49 
50-85 
86-146 
147-250 
251-429 
430-736 
737-1261 
1262-2162 


v0 100 RUN-O 


The results for this score are presented in Table 1.51. 


TABLE 1.51 


Shaw Test average time for ‘A’ or ‘B’ responses: transformed scores 
(Variable 65) 


Normals  Dysthymics  Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 2:35 2-42 2-50 3-75 4:25 
Variance 2:13 0-81 0-86 2:62 3:57 
Significance F = 5:84, p < 0:01 


As expected, this score produces exactly the same mean differences 
as did the average time per ‘A’ response. 

The fourth time score derived from the Shaw Test was the average 
response time, responses of all sorts being taken into account. It was 
expected that on this measure, retarded people should have the most 
extreme scores, because of their general slowness. On the other hand, 
overinclusive subjects should be slower than normal, because for them 
the test is more complex, with many more aspects to consider each time 
a sorting is performed. The log transformation shown in Table 1.52 
was applied to this score to xr 3 the distributions. 
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TABLE 1.52 
Transformation applied to Shaw Test, average response time 
Average Time Transformed 
(seconds) Score 
10-13 0 
14-19 1 
20-21. 2 
28-38 3 
39-54 4 
55-76 5 
71-107 6 
108-151 7 
152-212 8 
213-298 9 


The results are presented in Table 1.53. 


TABLE 1.53 
Shaw Test average response time: transformed scores (Variable 36) 


Normals  Dysthymics — Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-15 3:25 3-75 4:65 4:50 
Variance 1:50 1:48 1:64 2:45 3-42 
Significance F = 3:89, p < 0:01 


As expected, it is the depressives who have the most extreme scores 
on this measure, followed by the schizophrenics. 

(5) The Epstein Test of Overinclusive Concept Formation. This test 
was developed by Epstein (1953) as a direct measure of ‘overinclusion’ 
in concept formation as defined by Cameron. The test consists of a list 
of fifty words printed on two pages. Following each stimulus word there 
are six response words, including the word ‘none’. The subject is asked 
to underline all those response words which are an essential part of the 
Concept denoted by the stimulus word. Epstein predicted from Cameron’s 
theory that schizophrenics would underline more response words than 
normals, as their concepts would be more overinclusive. This was in 
fact the case in his initial study. Schizophrenics had a mean overinclu- 
Sion score of 20-92, while normals had a mean score of only 12:49. 
Payne and Hirst (1957) tested a small group of eleven endogenous 
depressives and fourteen matched normals. They found the normals to 
have a similar overinclusion score to Epstein’s group (13:79). However, 
the depressives were significantly more overinclusive obtaining a mean 
Score of 37-91, considerably higher than Epstein's schizophrenic group. 
Payne and Hirst concluded that the Epstein test might well be measuring 
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a characteristic common to all psychotics. However, in the preliminary 
study to the present investigation (Payne, Matussek and George, 1959), 
the schizophrenic group had a mean overinclusion score of 56:93 on 
this test, the neurotics obtaining a mean of only 16-64, very close to 
Epstein and Payne and Hirst’s normal groups. These findings are rather 
puzzling. On the face of it, this test should be a good measure of over- 
inclusive thinking. Why the depressives should have obtained such high 
scores in Payne and Hirst’s study, and why Epstein’s original schizo- 
phrenic group should have obtained such low scores, it is difficult to 
understand, as on all the measures of overinclusion so far discussed in 
the present study, depressives obtained normal scores, and only schizo- 
phrenics were overinclusive. It was hoped that the present study might 
resolve some of these discrepancies. 

The overinclusion score used in the present study is identical with 
that used in previous studies. It is based on those words which tended 
not to be underlined by normal people in Epstein’s initial study, and 
was scored according to Epstein’s procedure. In order to normalize the 
distributions, the log transformation shown in Table 1.54 was applied 
to the raw scores. 


TABLE 1.54 
Transformation applied to Epstein Test, overinclusion score (Variable 20) 
Transformed 
Raw Score Score 
3-4 0 
5-6 1 
7-8 2 
9-12 3 
13-17 4 
18-25 5 
26-36 6 
37-51 7 
52-73 8 
74-105 9 


The results are presented in Table 1.55. 


TABLE 1.55 
Epstein Test overinclusion score: transformed (Variable 20) 


Normals  Dysthymics  Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3:05 3:92 3-25 4-25 4-45 
Variance 4:37 6-63 4-79 651 10-05 
Significance F = 0:97, not significant 
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This is one of the few measures used in the present study which pro- 
duced insignificant differences between the groups. The normals obtained 
a mean score similar to those found in previous studies (approximately 
14 in raw score units). The neurotics obtained a very similar mean 
score (18) to those tested by Payne, Matussek and George. However, 
the depressives obtained a very much lower mean score than the group 
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FIG. 1.5. Epstein Overinclusion Score. 
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tested by Payne and Hirst (1957): approximately 22 in the present study, 
as contrasted with 38 in Payne and Hirst's study. The schizophrenics in 
the present study obtain a mean of approximately 32, closer to Epstein's 
original result, and markedly divergent from the mean of 57 found by 
Payne, Matussek and George. The present study throws very little light 
on the reason for these discrepancies. All the groups discussed were of 
similar (average) intellectual level, so that differences in intelligence can- 
Dot have produced them. The most striking feature in the present study 
Is the extremely large variance in the schizophrenic group (see Fig. 1.5). 
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This was one factor contributing to the lack of significance between the 
groups, and could explain why small groups of schizophrenics could 
vary unduly in their mean scores from sample to sample. Despite these 
contradictions, the means obtained in the present study are at least in 
the order expected from the theory of overinclusion. 

The Epstein test also includes neologisms. Five of the fifty stimulus 
words are neologisms as are five of the response words. Epstein did not 
investigate the responses to these neologisms separately. In the present 
study a ‘neologism score’ was calculated, which was merely the total 
number of words (other than the word ‘none’) underlined in response 
to a neologism, plus the number of neologisms underlined as responses. 
It was thought that this score might also be a measure of overinclusion, 
since a high score would suggest an individual whose concepts can in- 
corporate words which are meaningless, but which have features in 
common with everyday words. The neologism score was transformed 
linearly for ease of computation as shown in Table 1.56. 


TABLE 1.56 
Transformations applied to the Epstein Test neologism score (Variable 21) 
Transformed 
Raw Score Score 
0-2 0 
3-5 1 
6-8 2 
9-11 3 
12-14 4 


The results for this score are given in Table 1.57. 


TABLE 1.57 
Epstein Test neologism scores: transformed (Variable 21) 


Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 0-25 0:33 0-13 0:20 0-80 
Variance 0:30 0:42 0-12 0-17 1-64 
Significance F = 2:08, not significant 


Although this score produced differences in the expected direction, 
they are not statistically significant. Payne, Matussek and George (1959) 
found significant differences on this score between schizophrenics and 
neurotics, while Payne and Hirst (1957) found that endogenous depres- 
sives did not differ significantly from normals. Thus, in spite of the 
insignificant results obtained in the present study, the evidence overall 
still suggests that the neologism score is a measure of overinclusion, 
although it may not be particularly reliable. 
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If schizophrenics are overinclusive, they should find the Epstein Test 
more complicated than normal people. Their concepts should have a 
greater number of aspects, all of which would have to be considered. 
They should thus take longer to do the test. However, depressives should 
also spend longer at the test because of their retardation. The log 
transformation applied to the total time score to normalize the distri- 
butions is shown in Table 1.58. 


TABLE 1.58 
Transformation applied to Epstein Test total time (Variable 19) 
Total Time Transformed 
(seconds) Score 
407-510 0 
511-639 1 
640-801 2 
802-1004 3 
1005-1258 4 
1259-1577 5 
1578-1977 6 
1978-2477 7 
2478-3104 8 
3105-3890 9 
The results for this score are given in Table 1.59. 


TABLE 1.59 
Epstein Test total time: transformed scores (Variable 19) 


Normals Dysthymics Hysterics — Depressives Schizophrenics 


Number of cases — 20 11 6 17 18 
Mean 2:55 2-45 1-83 4-06 5-06 
Variance 1-42 2:87 337 431 5:82 
Significance F = 665, p < 0:01 


These results are as expected. . 

(6) Luchins’ Test of ‘Einstellung’ Rigidity. Yt was hypothesized that 
‘overinclusion’ is due to an abnormal degree of 'stimulus generalization" 
in Schizophrenia. This should make discrimination learning more diffi- 
cult. ‘Set’ of the ‘Einstellung’ type can be regarded as a special case of 
discrimination learning. For example, in Luchins’ (1942) ‘water jar’ 
test, a typical test of ‘Einstellung’ rigidity, in the course of a series of 
problems, a specific type of complicated solution is consistently ‘rein- 
forced’, simple methods not being found to work. Later, when this 
‘set? has been learned (discrimination learning), it is found that even 
when a simpler type of solution is possible, normal people continue to 
use the complex method due to their ‘set’. If the theory is correct 
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schizophrenic patients should not readily be able to learn a ‘set’ of this 
type. They should not be susceptible to the development of ‘Einstellung’ 
rigidity. 

fa order to test this hypothesis in the present experiment use was 
made of a modified form of Luchins (1942) Test based on a version by 
Rokeach (1948). 

The test items were drawn on cards each giving a representation of 
two (or three) containers, with a number in each to indicate its capacity. 
The subjects were shown Card 1, which depicted two containers, a 
thirty-nine gallon, and a four-gallon container. They were asked to 
imagine that they were beside a well, with only these two containers, 
and were asked how they could measure out precisely thirty-one-gallons 
of water. If they were unable to solve this trial problem, it was explained 
to them that this could be done by first filling the thirty-nine-gallon 
container, and then emptying out four gallons into the four-gallon con- 
tainer twice. The subjects were then given the ‘control problem’. This 
problem presented three containers, a thirteen, a twenty-nine and a 
three-gallon container. The subjects were asked to measure out ten 
gallons. The ‘control’ problem can be solved in two ways. If the con- 
tainers are labelled ‘A’, *B' and ‘C’, it can be solved by the method 
A-C (the ‘simple’ method) or by the method B-A-2C (the ‘difficult’ 
method). The ‘difficult’ method is the method which was subsequently 
reinforced, while the ‘set’ was being established. The purpose of the 
‘control problem’ was to demonstrate that all the subjects used the 
‘simple’ method before the set was established. Following the adminis- 
tration of the ‘control problem’, another item was demonstrated, con- 
sisting of a thirty, a forty and a three-gallon container. The subjects 
were asked to get four gallons. It was explained that this problem was 
more difficult and could only be solved in one way. The ‘complicated’ 
method (B-A-2C) was then demonstrated. The subjects were then given 
a series of five ‘set’ problems, to solve by themselves. Each problem 


TABLE 1.60 
Einstellung Test ‘set’ problems 

A B c 
21 127 3 get 100 
20 59 4 get 31 
18 43 10 get 5 
31 61 4 get 22 
14 59 10 get 25 


could only be solved by the ‘difficult’ method (B-A-2C), the purpose 

being to establish this set. (Rokeach used only two ‘set? problems. The 

number was increased to five in the present study, as it was hoped that 

all normal subjects would be made ‘rigid’ by this procedure, and only 
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the schizophrenics would tend to be ‘flexible’ at the end of the test.) 
The five ‘set? problems are shown in Table 1.60. 

The subjects were then given five ‘critical’ problems. These followed 
the ‘set’ problems without a pause. Each of the five ‘critical’ problems 
can be solved, either by the ‘simple’ (A-C) method, using only two 
jars, or the ‘difficult? (B-A-2C) method. These ‘critical’ problems are 
shown in Table 1.61. 


TABLE 1.61 
Einstellung Test ‘critical’ problems 

A B Cc 
23 49 3 get 20 
11 25 3 get 8 
17 40 6 get 11 
10 23 3 get 7 
11 27 5 get 6 


Following the ‘set’, it was expected that nearly all the subjects would 
use the ‘difficult’ method. Only those people who do not readily develop 
a ‘set’ of this sort, would use the ‘simple’ method. The score used was 
merely the number of ‘simple’ solutions used for the last five ‘critical’ 
Problems. It was predicted that schizophrenics, being overinclusive, 
would get higher scores than the other groups, as they would not have 
developed the ‘set’. 

Unfortunately, a direct comparison of the groups on this measure 
Was to some extent invalidated by the results obtained from the ‘control’ 
problem. It was assumed that, before the ‘set’, all the subjects would 
use the ‘simple’ method. This was not the case. Five of the twenty nor- 
mals, two of the twenty neurotics, five of the sixteen depressives and 
four of the eighteen schizophrenics who were given the test used the 
difficult? method of solution on the ‘control’ item. Although these 
differences are not statistically significant, the groups were not exactly 
equal in this respect at the beginning of the test. (Note that some sub- 
po <i not be given this test because of their defective arithmetic 

ility. 


TABLE 1.62 


Luchins "Einstellung" Rigidity Test. Number of simple solutions given 
following set (Variable 16) 


Normals  Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 16 18 
Mean 1-10 133 1-00 1-00 1:89 
Variance 3-99 2-97 3-71 3:47 3.99 
Significance F = 0-63, not significant 
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The results obtained from the ‘critical’ items are shown in Fig. 1.6. 
They are also presented in Table 1.62. 

The mean scores obtained from this measure are exactly in the pre- 
dicted direction. All the groups except the schizophrenics obtain almost 
jdentical mean scores, the schizophrenics obtaining more simple solu- 
tions as expected. However, these differences fail to achieve the 5% 
level of significance, using an analysis of variance procedure. As can be 


c n*20 


D LED 


n *18 


o ' 2 3 L 5 


FIG. 1.6. Luchins' Modified Water Jar Test. Number of simple 
solutions following ‘set’. 


seen fromthe histograms in Fig. 1.6, the distribution for the schizophrenic 
group tends to be ‘U’ shaped. No transformation would normalize these 
distributions, and the analysis of variance used is not, strictly speaking, 
an appropriate test of significance for this score. If we arbitrarily call 
0 or 1 ‘simple’ solutions a ‘rigid’ response to the test, and two or more 
‘simple’ solutions a ‘flexible’ response to the test, it is possible to perform 
a 7? test on these data. If the schizophrenics (the ‘overinclusive’ group) 
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are compared with a combination of all the other groups (a ‘non- 
overinclusive’ group), we find that the schizophrenics differ significantly 
at the 5% level on a ‘one tail’ test (y? = 3-00) in the direction of being 
more flexible. These results are regarded as supporting the theory espe- 
cially in view of the fact that the Luchins Test is probably not very reli- 
able. It is interesting that, on the above classification, half the schizo- 
phrenics are ‘flexible’ after the ‘set’, while only fifteen out of fifty-six 
of the remaining subjects are ‘flexible’ after the ‘set’. 

(7) Measures Derived from the Leiter-Partington ‘Pathways’ Test. It 
is hypothesized that ‘overinclusive’ subjects (schizophrenics) are unable 
to develop a ‘set’. On the Luchins Test and on the sorting tests, this is 
an advantage in that ‘overinclusive’ subjects are not blind to possible 
solutions which are missed by normal people. They are less rigid and 
more ‘original’. However, this lack of ‘rigidity’ or ‘set’ should also be 
a disadvantage in some situations. 

In the Leiter-Partington (1950) Pathways Test, the subject is presented 
with a series of circles randomly drawn on a piece of paper, and ran- 
domly numbered from 1 to 10. Without giving verbal instructions, the 
examiner with his pencil connects the circles in numerical order. The 
subject is then told ‘do as I did’ and presented with an identical sheet 
of paper. If he succeeds (as nearly all subjects do), he is told “do as I 
did’ with a second sheet of paper (the test proper) containing many more 
numbered circles in a different arrangement. The authors of the test 
found that normal people almost never make errors, and indeed they 
use it as an error-free speed test. In the second part of the test, the 
material and procedure are identical, except that the circles have to be 
connected in the order 1, A, 2, B, etc., numbers and letters alternating. 

The success of this test with normal people almost certainly depends 
upon the fact that when they perceive numbers being connected in order 
(or numbers and letters being connected), they immediately assume that 
the essence of the test is to connect the circles in numerical (or numerical- 
alphabetical) order. Presumably this strong ‘numerical set’ has been 
built up through past experience. They ignore the spatial juxtaposition 
of the circles on the paper, and the direction of the movements the 
examiner makes, judging these to be irrelevant. However, if schizo- 
phrenics do not adopt this normal ‘set’, and are unable to 'screen out 
these irrelevant aspects, they might well take them into account, and, 
therefore, elaborate some much more complex hypothesis about what 
the examiner is doing, involving all these aspects. However, when they 
àre given the proper test sheet, they find that it includes many more 
items, and that the spatial juxtaposition of the numbers and circles is 
Now quite different. Any complex hypothesis involving both numerical 
arrangement and spatial arrangement would no longer produce a solu- 
tion, since these aspects would now be in conflict. Indeed, anything 
more elaborate than a simple numerical sequence would not produce a 
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solution. It was predicted that, if schizophrenics do form over-elaborate 
hypotheses for these reasons, they should find this test much more 
difficult than normal people. They should take longer to do it. Further- 
more, the conflicts produced by an elaborate hypothesis which no longer 
‘fits’, should lead to errors. 

The subjects were given the two Leiter-Partington Pathway Tests ac- 
cording to the procedure described in the manual (Leiter and Partington, 
1950). In addition to timing the performance on each pathway in 
seconds, an error score was calculated. An error was scored for each 
circle left unconnected, and each circle connected out of proper sequence. 
Occasionally in pathway II, a single error (e.g. an omission) could result 
in a reversed sequence throughout the rest of the pathway (e.g. 
B-2-C-3-D-4, etc., instead of 2-B-3-C-4-D, etc.). When this hap- 
pened, only a single error was scored. If it happened a second time, 
however, each deviation from the correct sequence for the entire path- 
way was scored as an error. 

The cube root transformation which was applied to the error score 


for Pathway I in order to normalize the distribution is shown in 
Table 1.63. 


TABLE 1.63 
Transformation applied to number of errors in Pathway I (Variable 62) 
Transformed 
Errors Score 
0 0 
1 1 
2-8 2 
9-27 3 


The error score on Pathway II was transformed linearly for ease of 
computation as shown in Table 1.64. 


_ results for the two error scores are presented in Tables 1.65 and 


TABLE 1.64 
Transformation applied to number of errors in Pathway II (Variable 64) 
Transformed 

Errors Score 

0-3 0 

47 1 

8-11 2 
12-15 3 
16-19 4 
20-23 5 
24-27 6 
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TABLE 1.65 
Number of errors in Pathway I: transformed scores (Variable 62) 


Normals  Dysthymics Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 0-05 0 0 0:40 0-25 
Variance 005, 0 0 0:78 0:51 
Significance F — 1:40, not significant 

TABLE 1.66 


Number of errors in Pathway II: transformed scores (Variable 64) 


Normals  Dysthymics —Hysterics — Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 0:35 0:33 0:38 0:95 1-40 
Variance 1:40 0:79 0:55 3:10 5:13 
Significance F = 1-46, not significant 


Virtually none of the subjects made any errors at all on Pathway I, 
and the groups are not significantly different on this test. A similar 
result was obtained in the preliminary study (Payne, Matussek and 
George, 1959). It would appear that the task is so simple that, even if 
the schizophrenics do attend to irrelevant aspects of the demonstration 
pathway, they are easily able to perform the test proper. It is possible 
that, even if they had formulated the wrong hypothesis to account for 
the behaviour of the examiner on the demonstration item, the test is so 


TABLE 1.67 


Transformations applied to time scores on the Pathways Test 
(Variables 61 and 63) 


Pathway I Transformed Pathway II Transformed 
Time (seconds) Score Time (seconds) Score 
27-32 0 19-27 0 
33-40 1 28-39 1 
41-48 2 40-54 2 
49-59 3 55-76 3 
60-72 4 71-107 4 
73-88 5 108-149 5 
89-107 6 150-209 6 
108-131 7 210-293 7 
132-159 8 294-411 8 
160-194 9 412-575 9 


Simple that, as soon as they find themselves in ‘conflict’ they are able 


to perceive the simpler explanation. , 
The error score for Pathway II produced mean differences in the 
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expected direction, but these are not significant. In the preliminary 
study, this score had differentiated schizophrenics from neurotics at the 
5% level. Again, it is possible that, were a more difficult test constructed 
with a similar rationale, it would differentiate the groups better. 

If the pathways test is more difficult for overinclusive subjects, they 
should take longer to do it. However, general retardation should also 
lead to slowness on this measure. The time scores were thus not regarded 
as pure measures of overinclusion. In order to normalize the distribu- 
tions, the log transformations shown in Table 1.67 were applied. 

The results are presented in Tables 1.68 and 1.69. 


TABLE 1.68 
Time taken on Pathway I: transformed scores (Variable 61) 


Normals  Dysthymics — Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3:55 445 413 4:80 4:68 
Variance 2-68 0:67 1:27 4:17 612 
Significance F = 1-40, not significant 

TABLE 1.69 


Time taken on Pathway II: transformed scores (Variable 63) 


Normals  Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3.85 4:33 425 5:05 4:84 
Variance 2:56 2-06 1:36 3-10 4:66 
Significance F = 1-42, not significant 


These scores produce mean differences more or less as predicted. 
The depressives are slowest on both tests. On neither measure are the 
differences significant in the present study, although they significantly 
differentiated neurotics from schizophrenics in the preliminary study, 
(Payne, Matussek and George, 1959). 

(8) The Drawing Design Test. The ‘drawing design’ test was deve- 
loped by Beech (1956) for an investigation with brain damaged patients. 
It consists of a series of nine cards containing designs of the type used 
in a typical ‘block design’ test. The designs are placed on the table 
(which is covered with a standard background material) in front of the 
subject, eighteen inches from the edge of the table. He is asked to copy 
them in pencil in a small book, pinned at the edge of the table imme- 
diately in front of him. A fresh page is used for each fresh design. In 
order to investigate the accuracy of these reproductions, Beech deve- 
loped a ‘disproportionality’ score. This score consists of the average of 
a number of measurements taken from each drawing, which measure 
the extent to which the proportions are incorrect. Beech found, testing 
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a group of normals, neurotics, depressives, brain damaged patients and 
schizophrenics, that although they worked slowly, the schizophrenics 
produced significantly more accurate drawings than the other groups. 

It was thought that this measure could conceivably be related to over- 
inclusion. When normal people are asked to copy a drawing in this 
situation, they may have the ‘set’ that what the examiner requires is 
only a reasonable facsimile. They therefore spend little time, and repro- 
duce merely the salient features of the drawing. They disregard the 
detailed proportions as not being relevant in this context. Schizo- 
phrenics, on the other hand, being overinclusive, may not have this 
‘set’. When asked to copy the drawings, they may take this instruction 
quite literally, to include reproducing not only the salient features, but 
also all the details, as accurately as possible. Thus their work is slower 
but more accurate. 

The disproportionality score was calculated according to Beech’s 
(1956) method. The raw scores were transformed linearly for ease of 
Computation as shown in Table 1.70. 


TABLE 1.70 
Transformation used for Drawing Design Test disproportionality score 
(Variable 54) 

Transformed 
Score Score 
1:09-1-16 0 
1-17-1-23 1 
1-24-1-30 2 
1:31-1:37 3 
1:38-1:44 4 
1:45-1:51 5 
1:52-1:58 6 
1:59-1:65 T 
1:66-1:72 8 
1:73-1:80 9 


The results are presented in Table 1.71. 


TABLE 1.71 
Drawing Design Test disproportionality: transformed scores (Variable 54) 


Normals Dysthymics Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 3-10 2-67 3-25 3-40 2-50 
Variance 1:88 2:24 4:79 3:73 6:05 
Significance F = 0:68, not significant 


Although schizophrenics have the lowest mean score (i.e. are most 
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accurate), the differences are not significant. Since it has already been 
shown that the schizophrenics are overinclusive on a large number of 
tests, it appears doubtful whether unusual accuracy in this drawing 
test is associated with ‘overinclusion’, as expected. Clearly many other 
factors must influence performance on a test of this sort. 

The length of time spent on each drawing was also measured. If 
schizophrenics are indeed more careful in observing details, they should 
be unusually slow. Depressives also should be slow, however, if they 
are generally retarded. The average time spent on each drawing was 
transformed logarithmically in order to normalize the distributions as 
shown in Table 1.72. 


TABLE 1.72 
Transformation used for average time, Drawing Design Test (Variable 55) 
Transformed 

Time (seconds) Score 
3-98-5-21 0 
5:22-6:82 1 
6:83-8:93 2 
8:94-11:70 3 
11:71-15:31 4 
15:32-20:05 5 
20:06-26:24 6 
26:25-34:36 7 
34:37-44-98 8 
44:99-58-84 9 


The results are presented in Table 1.73. 


TABLE 1.73 
Drawing Design Test average time: transformed scores (Variable 55) 


Normals  Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3:30 4:50 475 5:30 5:55 
Variance 3-06 1-73 3-93 2:33 342 
Significance F= 5:38, p < 0-01 


In spite of the fact that the schizophrenics are not significantly more 
accurate, they are significantly slower on this test. The depressives, as 
expected, also tend to be slow. This result is very similar to those ob- 
tained from the Babcock motor tests, on which the schizophrenics were 
slow, even though their slowness could not reasonably be attributed to 
‘overinclusive’ thinking. Again this result suggests that schizophrenics 
might be generally retarded quite apart from their abnormal over- 
inclusion. 
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(9) Inductive vs. Deductive Reasoning Tests. It was thought likely 
that overinclusive thinking would affect performance on some standard 
cognitive tests. It would be most likely to affect tests which require the 
subject to perform inductive reasoning, since the categories induced 
would tend to be overinclusive, and occasionally distorted for this 
reason. It was thought on a priori grounds that a test likely to be so 
affected would be the Wechsler ‘Picture Arrangement’ Test. In this test, 
a series of pictures are placed in front of the subject in random order. 
The subject must first perform inductive reasoning, in that he must for- 
mulate some story which could account for all the pictures, before he 
can arrange them. It was thought likely that ‘overinclusive’ subjects 
would make up over-elaborate and rather unusual stories, and thus tend 
to obtain poor scores. 

It was thought that a test unlikely to be affected by ‘overinclusive’ 
thinking, would be one requiring the subject to perform deductive 
reasoning, manipulating concepts which have been ‘over-learned’ in the 
Past, and are thus least likely to be distorted. Numerical concepts would 
appear to fulfil this condition. One test of this sort would appear to be 
the Wechsler ‘Arithmetic’ sub-test. 

Both these tests were administered. It was predicted that the schizo- 
Phrenics would not be differentiated from the other groups by the 
Arithmetic test, but would tend to get low scores on the 'Picture 
Arrangement’ test. The tests were administered and scored in accord- 
ance with Wechsler's (1944) procedure. The ‘weighted sub-test’ scores 
FEN transformed linearly for ease of computation as shown in Table 

474. 


TABLE 1.74 


Transformations applied to Wechsler Arithmetic and Picture Arrangement 
scores (Variables 52 and 53) 


*Picture 
Arithmetic Arrangement" 
Weighted Transformed Weighted Transformed 
Score Score Score Score 
1-2 0 4-5 0 
3-4 1 6-7 1 
5-6 2 8-9 2 
7-8 3 10-11 3 
9-10 4 12-13 4 
11-12 5 14-15 5 
13-14 6 16-17 6 
15-16 7 18-19 7 
17-18 8 


The results obtained with these two tests are presented in Tables 1.75 
and 1.76. 
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TABLE 1.75 
Wechsler Bellevue Arithmetic Test: transformed scores (Variable 52) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 19 
Mean 5-00 4-50 5-13 3-60 4:37 
Variance 5-47 5:36 1-84 | 362 8:25 
Significance F = 148, not significant 

TABLE 1.76 


Wechsler Bellevue Picture Arrangement Test: transformed scores (Variable 53) 
Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 3-10 2:50 3-13 2:15 2:25 
Variance 2:31 1:91 2:98 2:03 1:46 
Significance F = 1-69, not significant 


The differences on neither test were significant, and these predictions 
cannot be said to have been borne out. 


(e) Tests of ‘Psychoticism’ 

In order to investigate the relationship between the tests of thought dis- 
order so far discussed, and ‘psychoticism’, a number of tests which are 
thought to measure this factor were included. The tests selected were 
those which differentiated best between a mixed psychotic group and a 
neurotic group in the studies reported by Eysenck, Granger and 
Brengelmann (1957) and Sybil B. G. Eysenck (1955), and were regarded 
by the authors as good tests of ‘psychoticism’. 

One of the best tests of *psychoticism' (Eysenck, Granger and Brengel- 
mann, 1957; Eysenck, 1953c) is the Nufferno Speed Test (A1), which 
has already been discussed. On inspection, a large number of the ‘psycho- 
ticism’ tests could be classified as speed tests, since they measure intel- 
lectual, perceptual or motor speed. Others are more difficult to classify. 
The tests selected are described in the following section. 


Speed Measures of ‘Psychoticism’ 

(1) The Brengelmann ‘Picture Recognition Test’. This test was deve- 
loped by Brengelmann (Eysenck, Granger and Brengelmann, 1957). In 
its original form, it consisted of a group of three-dimensional objects 
viewed monocularly through a camera shutter. The purpose of the test 
was to discover at what shutter speed the subjects were able to recognize 
the objects. In the study reported by Eysenck, Granger and Brengel- 
mann (1957), psychotics needed significantly longer shutter speeds than 
neurotics or normals, before they could identify the objects. The version 
of the test used in the present study uses pictures, rather than real objects. 
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This form of the test was developed by Brengelmann (1953). Pinillos 
and Brengelmann (1953) and Brengelmann (1958), obtained results simi- 
lar to those found with the three-dimensional form of the test. 

The test used in the present study consists of six pictures (see Plate IV) 
which were presented to the subjects tachistoscopically in a viewing box 
one at a time. The internal measurements of the viewing box were | ft. 
lin. x 84in. x 84 in. The stimulus card was slid in at the back of the 
box at a fixed distance of 1 ft. 24 in. from the eye position (the photo- 
graphic shutter of a 3-5 epsilon aperture). The inside of the box was 
painted matt off white and two 100-watt bulbs were attached inside the 
front of the box above and below the aperture. 

The subjects were told: ‘I am going to place a picture in this box. I 
want you to look at the picture through this shutter, and tell me what 
it is. You can operate the shutter here yourself by this cable. Go ahead 
when you are ready and tell me briefly what you see—for example you 
might see an aeroplane, a bowl of flowers, or perhaps a man with his 
dog. You will get a number of tries, but if you are not sure, you can have 
a guess.’ For pictures 3 and 6, the subject was told the number of objects 
that were to be identified on the card. After each response the subject 
was told whether he was correct or not. 

The acceptable answers for the pictures were (1) face, negro head or 
head; (2) railway junction or railway lines; (3) face, wool and fingers 
or hand—all three were required; (4) hands; (5) spectacles or sunglasses ; 
(6) person on chair—both objects were required. 

The subjects were given a standard series of eleven exposures for each 


TABLE 1.77 
Brengelmann Picture Recognition Test exposure times 

Trial Exposure Time 

1 15g second 

2 aby second 

3 4 second 

4 1 second 

5 3 second 

6 3 second 

7 1 second 

8 1 second 

9 3 seconds 
10 3 seconds 
11 30 seconds 


card, the exposure times getting progressively longer. The series of 
exposures was continued for each card until the subject obtained the 
correct answer. The exposure times were as shown in Table 17% 
The score for each card was merely the number of exposures necessary 
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to identify the picture. If the subject failed to identify the picture 
even with the thirty-second exposure (trial 11), a score of 12 was 
given. The total score for the test was the sum of the scores for all six 
pictures. . 

For ease of computation, this score was transformed as shown in 
Table 1.78. 


TABLE 1.78 


Transformation used for Brengelmann Picture Recognition Test scores 
(Variable 10) 


Sum of Transformed 
Exposures Score 
21-25 0 
26-30 if 
31-35 2 
36-40 3 
41-45 4 
46-50 5 
51-55 6 
56-60 7 
61-65 8 


The results obtained for this score are presented in Table 1.79. 


TABLE 1.79 
Brengelmann Picture Recognition Test: transformed scores (Variable 10) 


Normals  Dysthymics  Hysterics — Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 2-00 2:08 3:13 415 3-40 
Variance 1:16 1:54 1:27 1:61 4:25 
Significance F = 7:06, p < 0:01 


The results from this test are very similar to those obtained with the 
intellectual (Nufferno tests) and motor (Babcock tests) speed tests. The 
depressives are slowest, followed by the schizophrenics. Again, this is 
consistent with the hypothesis that depressives are generally retarded. 
However, as with the results obtained from the motor tests, it is difficult 
to see how abnormal ‘overinclusion’ could produce so much slowness 
on this test in the schizophrenic group. 

(2) The ‘Three Squares’ Test. This test is extremely simple. The sub- 
jects are given a plain sheet of paper, and merely instructed to draw 
three squares with a pencil. The score is merely the time required. This 
would appear to be a motor speed test, very similar to many of Babcock 
and Levy's (1940) simple motor speed tests. Eysenck (19522) and Sybil 
Eysenck (1955) found it to be a good test of ‘psychoticism’. 
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In order to normalize the distributions, the scores were transformed 
logarithmically as shown in Table 1.80. 


TABLE 1.80 
Transformation applied to ‘Three Squares’ test scores (Variable 51) 
Transformed 
Time (seconds) Score 
3-4 0 
5-6 1 
7-10 2 
11-16 3 
17-25 4 
26-38 5 
39-59 6 
60-90 7 
91-139 8 
140-213 9 
The results obtained are presented in Table 1.81 


TABLE 1.81 
Time to Draw Three Squares: transformed scores (Variable 51) 


Normals  Dysthymics Hysterics — Depressives Schizophrenics 


Number of cases — 20 12 7 19 20 
Mean 1-10 2:25 2-43 2:84 325 
Variance 0:73 1:48 0:95 1:36 2:83 
Significance F = 8:37, p < 001 


The results on this test are similar to those obtained with the other 
motor speed tests, excepting that it is the schizophrenics rather than 
the depressives who are slowest. This result is similar to that obtained 
with the drawing design test time scores. It is conceivable that the 
schizophrenics tend to be over-meticulous. 

(3) The USES Manual and Finger Dexterity Tests. These tests are 
both from the United States Employment Service (1955) General Apti- 
tude Test Battery. They were used as a measure of ‘Psychoticism’ by 
Sybil Eysenck (1955) who found that a combined score differentiated 
a psychotic group from a normal and neurotic group. . 

The Manual Dexterity test consists of a large board, in which large 
Wooden pegs are inserted in holes. The task is in two parts. In the first 
Section, the subject is required to move the pegs two at a time from one 
Part of the board to another, using both hands. In the second part the 
Subject is required to lift the pegs out of the holes with one hand, turn 
them over, and put them back again. The score is the number of pegs 
moved within the time limit for each section. 
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The Finger Dexterity test consists of a small board, containing rows 
of holes. In the holes on the upper part of the board are small rivets. 
On a vertical rod at the side of the board are a pile of washers. The sub- 
ject is first required to move the rivets from the upper part of the board 
one at a time, placing a washer on each rivet, and then to insert the 
assembled rivet and washer into the corresponding hole on the lower 
part of the board. In the second part of the test, the subject is required to 
disassemble the washers and rivets one at a time, placing the washers 
back on the rod, and moving the rivets back to the top part of the 
board. The score is the number of rivets moved in each section within 
the time limit. 

The tests were administered according to the instructions laid down in 
the manual. Following Sybil Eysenck (1955), a combined score was 
obtained from these tests. It was the sum of the scores obtained on the 
manual dexterity task (over three trials for each operation), plus the 
scores for assembling and disassembling on the Finger Dexterity test. 
That is, it is the sum of all the “raw” scores for both tests. 

The scores were transformed linearly for ease of computation as 
shown in Table 1.82. 


TABLE 1.82 


Transformation applied to Combined Manual and Finger Dexterity scores 
(Variable 56) 
Transformed 
Score Score 

115-129 0 
130-143 
144-157 
158-171 
172-185 
186-199 
200-213 
214-227 
228-241 
242-255 


000-10 tA IR O20 


The results are presented in Table 1.83. 


TABLE 1.83 


USES Manual and Finger Dexterity. Combined transformed scores 
(Variable 56) . 


Normals Dysthymics — Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 733 6:25 6:38 4-75 5:85 
Variance 1-61 1:66 1-12 4:30 4:56 
Significance F = 500, p < 0:01 
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These results are similar to those obtained with the other motor speed 
tests. The depressives are slowest, followed by the schizophrenics. 


Miscellaneous Measures of ‘Psychoticism’ 


(1) The ‘Waves’ Test. This test of ‘psychoticism’ was used by Eysenck 
(1952a) and Sybil Eysenck (1955), who found that it differentiated a 
psychotic from a normal and a neurotic group. This test might be des- 
cribed as a measure of éxpressive movement. It is difficult to see why 
this test should be related to any of the other measures used in the 
present battery. 

The subjects were given a sheet of squared paper with a ‘V’ marked 
plainly in each corner. They were instructed to trace over the first ‘V’, 
keeping the hand and arm holding the pencil off the paper. They were 
then instructed to close their eyes and draw six more ‘V’s, still keeping 
their arm and hand off the paper. The subjects were instructed to keep 
their eyes shut until the examiner had covered up the tracings. With the 
paper still in the same position, the subjects were instructed to continue 
with the next ‘V’, tracing over it with their eyes open, as before, and 
then tracing six more ‘V’s with their eyes closed. This was repeated for 
each of the four ‘V’s. 

The scores used were both measures of the size of the *waves' drawn. 
They were (1) the ‘average wavelength’, which was the sum of the wave- 
lengths (in millimetres) for each set, divided by four, and (2) the 'aver- 
age amplitude', which was the sum of the first and last amplitude in 
each set of waves, divided by eight (again measured in millimetres). 
Previous studies had found that psychotics produced much larger waves, 
as measured by both these scores, than did normals or neurotics. 

The raw scores were transformed linearly as shown in Table 1.84 for 
ease of computation. 


TABLE 1.84 
Transformations applied to ‘Waves Test’ raw scores (Variables 26 and 27) 
Average Transformed Average Transformed 
Amplitude Score Wavelength Score 

16:01-18-00 0 40:01-51:00 0 
18:01-20:00 1 51:01-62:00 1 
20:01-22:00 2 62:01-73:00 2 
22-01-24-00 3 73-01-84-00 3 
24:01-26:00 4 84-01-95-00 4 
2601-28-00 5 95-01-106-00 5 
28:01-30:00 6 106-01-117-00 6 
30:01-32:00 7 117-01-128-00 7 
3201-34-00 8 128:01-139-00 8 

139-01-150-00 9 


The results for these two scores are presented in Tables 1.85 and 1.86. 
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This test produced curiously inconsistent results. Only the ‘average 
amplitude’ differentiated the groups significantly, and on this measure 
the depressives and the dysthymics obtained the largest soores. Since 
the dysthymic group contains a high proportion of cases of ‘reactive 
depression’, it might be argued that individuals in a depressive mood, 
tend to draw large waves. However, this order is not repeated for the 


TABLE 1.85 
Waves Test ‘Average Amplitude’: transformed scores (Variable 26) 


Normals Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 1-70 3-50 2:63 3:95 2:55 
Variance 1:06 227 3-70 412 447 
Significance F = 4:63, p < 0-01 

TABLE 1.86 


Waves Test ‘Average Wavelength’: transformed scores (Variable 27) 


Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 3-00 3:92 5:13 4-60 3-45 
Variance 3°79 1:90 8-27 3-41 3:95 
Significance F — 246, not significant 


‘average wavelength’ score. The correlation between these two scores 
was only 0-46, surprisingly low when it is considered that both measures 
are derived from the same set of waves. It is possible, of course, that 
neither measure is very reliable. Whatever the explanation of these 
results, it is clear that, so far as the present study is concerned, there 
is no evidence that either score can be regarded as a measure of 
‘psychoticism’. 

(2) Strength of Grip. Sybil Eysenck (1955) found that strength of 
handgrip, as measured by a hand dynamometer, differentiated between 
a normal, a neurotic, and a psychotic group. Normals were strongest, 
neurotics less strong, and psychotics weakest. This test was, therefore, 
included in the present study as another suggested measure of 
'psychoticism'. 

The subjects were given a small spring steel hand dynamometer, and 
asked to squeeze it as hard as possible with their preferred hand. When 
the subject had no preferred hand, he was asked to squeeze it once 
with both hands, and the higher measurement was recorded. The dyna- 
mometer was marked off in Kgs. of pressure exerted. This score was 
transformed linearly for ease of computation as shown in Table 
1.87, 
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TABLE 1.87 
Transformation applied to Maximum Grip (Variable 28) 
Transformed. 
Score (kgm.) Score 

9-15 0 
16-22 1 
23-29" 2 
30-36 3 
37-43 4 
44-50 5 
51-57 6 
58-64 7 
65-71 8 
72-18 9 


The results for this score are presented in Table 1.88. 


TABLE 1.88 
Maximum Grip: transformed scores (Variable 28) 


Normals  Dysthymics —Hysterics Depressives Schizophrenics 


Number of cases 20 12 8 20 20 
Mean 4:00 3:17 2:00 2:30 345 
Variance 642 415 2:25 4:54 6:05 
Significance F = 2:02, not significant 


In the present study, strength of grip did not differentiate the groups 
significantly, Although the differences were insignificant, it is interest- 
ing that the hysterics had the lowest mean score. This group had the 
highest proportion of women to men: six women and two men. It 
seemed likely that this could account for this result. As a further check 
on the possible ‘contaminating’ effect of slightly unequal sex distribu- 
tions in the groups, the analysis of variance was repeated for the two 
sexes separately. The F ratios obtained were 2-11 for men, and 1-54 for 
women, neither being significant. This failure to repeat Sybil Eysenck’s 
(1955) findings may well be due to the fact that in the present study, the 
Broups were matched for age. This was not the case in Sybil Eysenck's 
research. Her normal group had a mean age of twenty-two, her neurotic 
Broup a mean age of thirty and her psychotic group a mean age of 
thirty-nine. (These groups were matched for sex.) The strength of grip 
Score placed the groups in the reverse order to their mean age, and it 
Seems likely that the results were merely due to these age differences, 
1e. the younger subjects tended to be stronger. There is thus no evidence 
that strength of handgrip is related to ‘psychoticism’. 

(3) Static Ataxia. Sybil Eysenck (1955) found that when she asked her 
Subjects to stand with their feet together and their eyes closed for thirty 
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seconds, the psychotics swayed more than the other groups. Two meas- 
ures differentiated the groups significantly, the ‘number of reversals’ 
(i.e. the number of separate to and fro movements) and the maximum 
extent of backwards sway. In the present investigation, the subjects 
were asked to take off their shoes, and stand with their feet together 
and their hands by their sides. They were then asked to close their eyes, 
and their movements were measured for a thirty-second period. This 
was done by means of a recording apparatus activated by a string 
clipped on to the subject's collar. Four measures were initially taken: 
(1) The total number of reversals, or changes of direction of movement; 
(2) the total movement—i.e. the distance between the largest backward 
and forward sway; (3) the maximum backward sway from the original 
upright position; and (4) the maximum forward sway from the original 


TABLE 1.89 
Transformation applied to Static Ataxia “Number of Reversals’ (Variable 29) 
Transformed 
Reversals Score 

1-3 0 
4-6 1 

7-9 2 
10-12 3 
13-15 4 
16-18 5 
19-21 6 
22-24 T 
25-27 8 

TABLE 1.90 
Transformation applied to Static Ataxia ‘Total Movement’ score (Variable 30) 
" Total 
ovement 
(in half inches) —À » 

1-2 0 

3-4 1 

5-7 2 

8-10 3 
11-14 4 
15-18 5 
19-23 6 
24-29 9 
30-35 8 
36-42 9 


upright position. Due to a failure of one part of the recording appara- 
tus, the fourth score (maximum forward sway) had to be phe ae 
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One subject, a female ‘hysteric’, fell over backwards during the course 
of the test. She was arbitrarily given the maximum possible backward 
sway score. 

The ‘number of reversals’ score was transformed linearly for ease 
of computation as shown in Table 1.89. 

A square root transformation was applied to the ‘total movement’ 
score to normalize the distributions as shown in Table 1.90. 

A ‘hyperbolic tangent" transformation was applied to the ‘maximum 
backward sway’ score, to normalize the distributions as shown in 
Table 1.91. 


TABLE 1.91 
Transformation applied to Static Ataxia Maximum Backwards Sway score 
(Variable 68) 

Backward 
Sway Transformed 
(in half inches) Score 

[U 0 

1 1 

2 2 

3 3 

4 4 

5 5 

6-7 6 

8-11 7 

12-00 8 


The results for these scores are presented in Tables 1.92, 1.93, and 1.94. 


TABLE 1.92 
Static Ataxia, number of reversals: transformed scores (Variable 29) 


Normals  Dysthymics — Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 1:65 2:58 3:25 2:25 2:45 
Variance 1:29 2:63 4:50 2:09 1:94 
Significance F = 1-98, not significant 

TABLE 1.93 


Static Ataxia, total movement: transformed scores (Variable 30) 


Normals Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 

Mean 1-95 3-00 3-63 2:60 3:00 
Variance 173 273 742 3-09 232 
Significance F — 1:84, not significant 
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There is no evidence that a measure of swaying, as such, differentiates 
the groups significantly. Neither the ‘number of reversals’ nor the ‘total 
movement’ produced significant differences. In this respect, the present 
results are not consistent with Sybil Eysenck’s (1955) earlier findings. 
However, the maximum backward sway does differentiate the groups 


TABLE 1.94 
Static Ataxia Maximum Backward Sway: transformed scores (Variable 68) 


Normals  Dysthymics Hysterics Depressives Schizophrenics 


Number of cases — 20 12 8 20 20 
Mean 4:05 4:92 3:63 445 6:05 
Variance 4:05 221 8-84 6:79 3:21 
Significance F = 2:96, p < 0:05 


at the 5% level, and it is only the schizophrenics who have extreme 
scores. It is a pity that the measure of forward sway had to be aban- 
doned. The schizophrenics are not significantly different in terms of 
their total movement. Thus, in order to sway further backwards than 
the other groups, they must have swayed less forwards. This finding 
is a very curious one, and it is difficult to see how it might be related to 
the other abnormalities investigated in the present study. It is of interest 
that this was the only test of ‘psychoticism’ which was not merely a test 
of motor, mental or perceptual speed, which differentiated the groups 
significantly in the present study. 

(4) The Word Association Test. The word association test has been 
used as a diagnostic instrument for many years, and a large number of 
studies has been carried out with various clinical groups. It is not pro- 
posed to review this literature here. The word association test used in 
the present study was chosen because Sybil Eysenck (1955) regarded it 
as a good test of ‘psychoticism’. A number of measures differentiated 
between her psychotic and normal and neurotic groups. She found that 
psychotics: (1) more frequently repeated the stimulus word; (2) more 
frequently gave multiple responses; (3) more frequently perseverated 
the same response; (4) more frequently made ‘aside remarks’ and (5) 
more frequently gave unusual associations. Several of these find- 
ings had previously been reported as characterizing schizophrenic 
patients. 

, It was thought that a number of these responses to the word associa- 
tion test might be associated with overinclusion. For example, the ten- 
dency to give more ‘distant’ or ‘unusual’ associations might well be due 
to abnormally inclusive concepts. If words mean more to overinclusive 
subjects, they are likely to have a wider range of associations to them. 
Indeed, Moran (1953) used a word association test as one measure of 
overinclusion, and found that the Schizophrenics who gave more *dis- 
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tant’ associations to a set of words, tended to give the same words an 
‘overinclusive’ definition. The tendency to give multiple responses, and 
aside remarks, and to repeat the stimulus word could conceivably be 
due to the failure to adopt the normal ‘set’ which prevents normal people 
from including such things in their response repertoire, presumably 
because they assume that it is not allowed. 

The thirty stimulus words used in the present study were the same as 
those used by Sybil Eysenck (1955). They were as follows: cold, foot, 
butter, music, red, dream, sweet, soft, sleep, dark, quiet, hungry, anger, 
beautiful, man, loud, long, blue, slow, working, child, heavy, rough, 
memory, house, trouble, religion, hard, deep and wish. Each stimulus 
word was printed on a card. The instructions were as follows: ‘Tam 
going to show you some cards on which you will see a single word 
written. As soon as you see the word I want you to tell me the first 
word that comes to your mind. It does not matter what the word is, 
but it must be the first word you think of, and you should give it as 
quickly as possible.’ The cards were shown one at a time. As soon as 
the subject responded, the next card was presented. 

The following scores were obtained: (1) The number of synonyms 
given as responses; (2) the number of repetitions of the stimulus word; 
(3) the number of multiple response (more than one word); (4) the 
number of aside remarks given, and (5) the number of repetitions of 
previous stimulus or response words given. 


TABLE 1.95 


Transformations used for Word Association Test ‘Number of Synonyms’ and 
Number of Repetitions of Stimulus Words (Variables 42 and 43) 


Number of 
Number of Transformed Repetitions of Transformed 
Synonyms Score Stimulus Word Score 

0-1 0 0-3 0 
2-3 1 4-7 1 
4-5 2 8-11 2 
6-7 3 12-15 3 
8-9 4 16-19 4 
10-11 5 20-23 5 
12-13 6 24-27 6 
28-31 7 


The ‘number of synonyms’ and the ‘number of repetitions of the 
stimulus word’ were transformed linearly for ease of computation as 
Shown in Table 1.95. 

The square root transformation which was applied to the ‘number of 
multiple responses’ in order to normalize the distributions is shown in 
Table 1.96. 
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TABLE 1.96 


Transformation applied to ‘Number of Multiple Responses’ Word Association 
Test (Variable 44) 


Multiple Transformed 
Responses Scores 

0 «0 
1-2 1 
3-6 2 
7-12 3 
13-20 4 
214 5 


The ‘number of aside remarks’ and the ‘number of repetitions of 
previous stimulus or response words’ were transformed linearly for 
ease of computation as shown in Table 1.97. 


TABLE 1.97 


Transformations applied to ‘Aside Remarks’ and ‘Repetitions of previous 
Stimulus or Response’ Word Association Test (Variables 45 and 46) 


Aside Transformed Transformed 
Remarks Scores Repetitions Scores 

0-1 0 0-1 0 
2-3 1 2-3 1 
4-5 2 4-5 2 
6-7 3 6-7 3 
8-9 4 8-9 4 

10-11 5 10-11 5 

12-13 6 12-13 6 

14-15 T 14-15 7 

16-17 8 

18-19 9 


The results for these scores are presented in Tables 1.98, 1.99, 1.100 
1.101 and 1.102. 
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TABLE 1.98 
Word Association Test Number of Synonyms: transformed scores 
(Variable 42) 
Normals Dysthymics Hysterics Depressives Schizophrenics 
Number of cases 20 12 8 19 20 | 
Mean 1-30 1:50 0:75 0-84 1-10 | 
Variance 1:80 1:55 0-79 1:03 2-83 


Significance F — 0771, not significant 
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TABLE 1.99 


Word Association Test Number of Repetitions of Stimulus Word: 
transformed scores (Variable 43) 


Normals  Dysthymics  Hysterics Depressives Schizophrenics 


Number of cases 20 Ta 8 19 20 
Mean 055 5 1-00 2:50 2:47 2:35 
Variance 1:94 491 10:57 7:534. 10:56 
Significance F — 2-13, not significant 

TABLE 1.100 

Word Association Test Number of Multiple Responses: transformed scores 
(Variable 44) 
Normals  Dysthymics  Hysterics Depressives Schizophrenics 

Number of cases 20 12 8 19 20 
Mean 0:65 0-67 0-75 1:37 0:75 
Variance 1:40 0:79 1:93 213 1-78 
Significance F = 1-01, not significant 

TABLE 1.101 

Word Association Test Number of Aside Remarks: transformed scores 
(Variable 45) 
Normals  Dysthymics  Hysterics Depressives Schizophrenics 

Number of cases 20 12 8 19 20 
Mean 0-60 0-67 0:75 1-21 1:30 
Variance 0:99 0:79 2-21 2:51 4:54 
Significance F = 0:79, not significant 

TABLE 1.102 


Word Associaton Test Number of Repetitions of Previous Stimulus or 
Response Words: transformed scores (Variable 46) 


Normals Dysthymics  Hysterics Depressłves Schizophrenics 


Number of cases 20 12 8 19 20 
Mean 1:35 0-75 0:75 1:32 115 
Variance 077 0:57 0:50 2-67 1-61 
Significance F = 0:81, not significant 


None of these measures differentiate the groups significantly, nor are 
the mean positions of the groups in any consistent order. The predic- 
tions for this test are thus unverified, and Sybil Eysenck’s earlier results 
are, for some reason, not repeated in the present study, 


R. W. Payne and J. H. G. Hewlett 


VI. SUMMARY OF THE RESULTS OF THE ANALYSES OF VARIANCE 


(1) In the battery of tests given, there were thirteen measures of 
‘overinclusion’ believed not to be influenced by other factors, such as 
speed. These thirteen measures all produced mean differences in the 
expected direction, in that the schizophrenics always obtained the most 
extreme scores. Five of these measures produced differences between the 
groups significant at beyond the 1% level, and one measure produced 
differences significant at the 5% level, while the remaining seven meas- 
ures did not differentiate the groups significantly. On the whole, these 
findings support the hypothesis that schizophrenics are abnormally 
‘overinclusive’. On none of these measures did the other groups show 
marked differences. There was no evidence that the depressives or the 
two neurotic groups are abnormally overinclusive. This type of thought 
disorder is apparently confined to schizophrenic patients. 

(2) Five measures of ‘concreteness’ were obtained. Three differentiated 
the groups significantly at the 5% level or better. These were the ratings 
derived from the two Goldstein Sorting Tests, and the Benjamin Pro- 
verbs test. Only the schizophrenics were abnormally ‘concrete’ on these 
measures, there being no differences between the other groups. These 
results were not attributed to the schizophrenics’ inability to ‘abstract’, 
but to their tendency to produce unusual responses on these tests, 
which are arbitrarily classified as ‘concrete’ when the traditional method 
of scoring is used. Other sorting tests given suggested that, in terms of 
the capacity to produce ‘usual’ abstract generalizations, when no time 
limit is imposed, there are no differences at all between the groups. 
Neither of the psychotic groups are ‘concrete’. 

(3) Eleven measures of ‘overinclusion’ based on time scores were 
obtained. Schizophrenics were expected to obtain large (slow) scores 
because of their ‘overinclusion’, and depressives because of their general 
retardation. Nearly all these scores differentiated the groups at beyond 
the 1% level of significance. The depressives and the schizophrenics 
tended to be equally slow on these measures. The neurotics tended to 
obtain normal scores. 

(4) Three tests of reasoning speed were included. It was predicted 
that schizophrenics would be slow because of their ‘overinclusion’ and 
depressives because of their retardation. The depressives were slightly 
slower than the schizophrenics, although both groups were consider- 
ably slower than normal. All the tests differentiated the groups at 
beyond the 5% level, and two of these at beyond the 1% level of 
significance. There was no evidence of abnormal slowness among the 
neurotics. 

(5) Seven measures of motor, and perceptual slowness were included. 
It was predicted that the schizophrenics would obtain normal scores on 
these tests, as it was hypothesized that slowness in schizophrenia is a 
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by-product of overinclusive thinking, and would thus be confined to 
intellectual tasks. This prediction was clearly disproven. Both the 
depressives and the schizophrenics tended to be abnormally slow on 
these tests, there being very little difference between them. This finding 
suggests that schizophrenics as a group may suffer from two unrelated 
abnormalities, a general retardation which they share with depressives, 
and overinclusion, which is specific to schizophrenia. There was no 
evidence for abnormal motor or perceptual slowness in the two neurotic 
groups. 

(6) Of the seventeen measures which have been used previously as 
tests of ‘psychoticism’, three are tests of intellectual speed (the Nufferno 
tests), and three are tests of motor or perceptual speed. All six speed 
tests differentiated the groups significantly. The depressives tended to 
be slightly slower than the schizophrenics, both groups being abnormally 
slow. The neurotics had normal scores on. these tests. Of the eleven 
‘miscellaneous’ measures of ‘psychoticism’ used (none of which were 
Speed tests) only one differentiated the groups significantly at the 5% 
level. This was the ‘maximum backward sway’ on the ‘Static Ataxia’ 
test. There appeared to be no obvious relationship between this score 
and the other measures of thought disorder used. 


VII. THE INVESTIGATION OF THE CO-VARIANCE OF THE TESTS 


(a) The Centroid Analysis. Although the tests of *overinclusion' tend 
to differentiate the groups as expected, it was necessary to demonstrate 
that they intercorrelate positively, that they thus measure a common 
factor, and that this factor also differentiates the groups in the way 
predicted from the theory of overinclusion. 

In order to demonstrate this, all the groups were combined, product- 
moment correlations were calculated between all the measures, and the 
resulting matrix of intercorrelations was factor analysed. Before des- 
cribing this procedure in more detail, it is necessary to discuss briefly a 
criticism of this method of analysis which is often made. It has already 
been demonstrated that most of the measures obtained produce signifi- 
cant mean differences between the groups. In other words, the groups 
cannot be regarded as coming from the same population. Since the tests 
are known to differentiate the groups significantly, if the groups are then 
combined and correlation coefficients computed, ‘spuriously high’ cor- 
relations will be obtained. According to this line of argument, the correct 
Procedure would have been to intercorrelate the tests separately for each 
group. This would have given a more valid estimate of the ‘true’ cor- 
Telation between the measures. This point of view was not accepted. It 
Seems equally correct to argue that, had the correlations been computed 
Or each group separately, the correlations obtained would have been 
Spuriously low’, because of the arbitrary restriction of the range of 
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variation of the measures. A very similar problem could arise in an 
experiment in educational psychology. An educational psychologist 
might be concerned with investigating for a ten-year-old school popula- 
tion, the differences to be found between pupils in an ‘A’, a ‘B’ and a 
very retarded ‘stream’. He might administer a group of cognitive tests 
to the pupils, and discover that the three ‘streams’ were very signifi- 
cantly different with respect to their intellectyal level on most of the 
tests. The ‘A stream’ might tend to be very bright, and the ‘retarded 
stream’ very dull. Indeed, this might have been what he had predicted, 
since his initial hypothesis might have been that allocation to a ‘stream’ 
is largely a function of native ability. The psychologist might then wish 
to factor analyse his tests, in order to discover to what extent their 
co-variance might be attributable to a single factor. Very few people 
would argue that he ought not to combine all his ten-year-olds in order 
to investigate his test co-variances. In fact it would be a justifiable 
comment that, if he factor-analysed each ‘stream’ separately, he would 
be unduly restricting the range of variation, and seriously under- 
estimating the co-variance of his tests and the importance of any general 
factor that might emerge. 

This hypothetical example is believed to be relevant in many ways to 
the present investigation. Many of the tests of ‘overinclusion’ used in 
the present study were primarily designed to be measures of an ab- 
normality of function. It was hoped that all normal people would have 
more or less the same score, while only abnormal people would have 
extreme scores. In other words, they were deliberately designed to 
produce no variance in a normal group. It would thus be incorrect to 
investigate their co-variance in a normal group only, or in any other 
single group in which the variance could be expected to be extremely 
limited. The only condition which should be fulfilled, is that the distribu- 
tions on all the tests for the combined group on which the correlations 
are to be obtained should be normal (as should be each bi-variate 
distribution). The main purpose of transforming so many of the scores 
in the present study was to ensure that this would be the case as far as 
possible. 

Not all the measures discussed were included in the factor analysis. 
In some cases, two measures were not mathematically independent of 
each other, and one had to be excluded. Table 1.103 lists the measures 
which were retained for the correlational study. As can be seen by com- 
paring this table with Table 1.2, the following measures were omitted: 
Variable 13 (Total time on the Object Classification Test. This was 
omitted because it is a function of the other time score included), 
Variable 17 (Luchins Test: number of ‘difficult’ solutions following ‘set’. 

This variable was initially coded by mistake. It is merely the inverse of 
the number of ‘simple’ solutions following ‘set’), Variable 36 (Shaw test, 
average response time. This was omitted because it is a function of the 
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other time scores included), Variable 67 (Shaw test, average time per ‘A’ 
response. This was excluded for the same reason as Variable 36). 
Variables 47, 48 and 49, the scores for the three Babcock Motor Speed 
tests (Writing name, writing sentence, writing ‘U.S.A.’) were combined 
on the assumption that they measure the same function, and that a 
combined score would be more reliable. The combined score was merely 
the sum of the transformed scores for the three tests. This variable will 
subsequently be referred to as Variable 47. 

As can be seen from the tables so far given, on several of the tests, 
one or two subjects returned rio score. This is almost inevitable in such 
a large-scale investigation with abnormal people. This was usually due 
to a sudden complete unwillingness to co-operate. In obtaining the cor- 
relations for these few variables, the number of cases was slightly 
reduced. While it is not strictly correct to include any subjects who did 
not produce scores on all the tests, this happened so rarely that it was 
judged not to have any effect on the overall results. 

The present battery of tests was chosen to measure two main aspects 
of thought disorder, ‘Overinclusion’ and ‘General Slowness or Retarda- 
tion’. These two ‘factors’ should be the two major sources of co- 
variation within the matrix. However, a third important source of 
co-variation cannot be ignored. The groups were matched as closely as 
possible for pre-illness general intellectual level, so that no mean differ- 
ences could be attributed to this factor. Nevertheless, in this battery of 
mainly cognitive tests, this is bound to be an important source of co- 
variation. No other sources of co-variation were regarded as important 
in the present study. Scores on the tests of ‘concreteness’ for example, 
were expected to be partly a function of ‘overinclusion’ (because the 
schizophrenics were judged ‘concrete’ on the basis of their overinclusive 
responses) but mainly a function of general intellectual level. The 
various tests of ‘psychoticism’ were thought to be measures of either 
‘slowness’ or ‘overinclusion’, in so far as they measured anything at all 
relevant to the discrimination between the groups. 

Since these three factors were the only important sources of co- 
variation which could be specified on theoretical grounds before the 
factor analysis was initiated, it was only thought justified to extract three 
factors from the matrix. If the theories behind the test selection were 
Correct, it should be possible, by the appropriate rotations, to demon- 
Strate that these three hypothesized factors could be identified clearly, 
and that factor scores derived from them should differentiate the groups 
as predicted. 

The matrix of intercorrelations was factor analysed by means of 
Thurstone’s (1947) method for extracting centroid factors. In view of 
the Subsequent analyses performed, it would have been strictly more 
Correct, to have performed a principal component analysis (see chapter 
by P. Slater). However, with such a large matrix of correlations, the first 
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TABLE 1.103 
Factor Saturations 


Factor 


Unrotated Centroids Rotated Factors 
Variables I Il ul Factor ‘O’ Factor ‘P’ Factor ‘R’ Factor 'G* 
9. Mill Hill Vocabulary ‘IQ’ —0:36 046 | —031  —037 010 004 —054 
10. Brengelmann Picture Recognition Test 0:38 0:05 023 0:13 0:29 0:37 022 
11. Object Classification Test No. of ‘A’ Responses —0:67 0:27 0:03 0:06 —0:25 —0:24 — 0:68 
12. Object Classification Test ‘Non A’ Responses 0-68 0-29 —027 —0:46 0:67 0:54 0:33 
14. Object Classification Test Average Response Time 0:49 0:35 0:04 —0-16 0:59 0:57 012 
15. Obj. Class. Test: Average Time per *A* Response 0-75 0:16 —0:15 —0:32 0:62 0:54 0:46 
16. Luchins Test: No. of Simple Solutions after ‘Set’ —0:22 0:13 —0:07 —0:07 —0:05 —0:08 —024 
18. Luchins Test: Difficult Solutions before ‘Set’ 0-22 0-04 0:39 0:31 0-18 0:32 0:110 
19. Epstein Test: Total Time 0-58 0-38 0:10 —013 0:67 0:67 0:16 
20. Epstein Test: Overinclusion Score 0-56 —0:42 —0:07 —0:03 0:07 0:06 0:70 
21. Epstein Test: Neologism score 0-61 —0-09 —0:32 —0:337 035 ^ 023 0:54 
22. Goldstein Colour-Form *Concreteness' rating 0:71 —0:03 —0:29 —0:38 0:46 0:35 0:57 
23. Goldstein Object Sorting ‘Concreteness’ rating 0-68 —0-11 —0:07 —0-15 0:38 0:336 0:58 
24. Goldstein Object Sorting ‘Overinclusion’ score 0:06 0:36 —0:44 —0:54 0:31 013 —016 
25. Goldstein Object Sorting unusual sortings 0:10 0-20 —0-46 —0:51 022 0:04 —002 
26. Waves Test: Average Amplitude 0:41 0-00 —012 —0:18 0-28 023 031 
27. Waves Test: Average Wavelength 0:30 — 0-07 —0:16 —0:18 0-15 0:10 0:28 
28. Maximum Grip: Dynamometer —0:13 0-34 0:09 —0:00 0:16 017 —033 
29. Static Ataxia: No. of Reversals 0:25 0-14 —021 —0:28 0:27 0-18 011 
30. Static Ataxia: Total Movement 0:29 0-04 —0:20 —0:25 0:23 0:15 021 
31. Shaw Test: No. of ‘A’ Responses —072 0:47 —0-19 —0:20 —0:14 —023 — 0:83 
32. Shaw Test: No. of 'B' Responses —0-08 0-34 —0:40 —0:47 0:20 0:04 —025 
33. Shaw Test: No. of 'C' Responses 0-48 0-06 —0-20 —0:29 0:37 0:29 0:33 
34. Shaw Test: No. of 'D' Responses 052 —0:07 —042 —0:46 0:30 015 047 


35. Shaw "Yest: Total Time 0-16 0-72 —017 —0:42 0-64 0:53 —035 


37. Nufferno Level Test 
38. Level Test Persistence Score 

39. Nufferno Speed Test A1 Unstressed 

40. Nufferno Speed Test BI Unstressed 

41. Nufferno Speed Test A2 Stressed 

42. Word Association Test: No. of Synonyms 
43. Word Assoc. Test: Repetition of Stimulus Word 
44. Word Assoc. Test: Multiple Responses 
45. Word Assoc. Test: Aside Remarks 
46. Word Assoc. Test: Repetition of Previous S or R 
47. Babcock Motor Speed Tests 
50. Babcock 'Substitution' Test 
51. Time to Draw three squares 
52. Wechsler Arithmetic sub-test ` 
53. Wechsler Picture Arrangement sub-test 
54. Drawing Design Test Disproportionality 
55. Drawing Design Test: Average Time 
56. U.S.E.S. Man. and Finger Dexterity 
57. Proverbs Test *Abstract' Score 
58. Proverbs Test: Average No. of Words 
59, Proverbs Test: Average Reaction Time 
60. Proverbs Test: Average Total Time 

61. Pathway I: Time Required 

62. Pathway I: Errors Made 

63. Pathway II: Time Required 

64. Pathway II: Errors Made 

65. Shaw Test: Average ‘A + B' Response Time 
66. Shaw Test: Total Number of Responses 

68. Static Ataxia: Maximum Backwards Sway 


—0:08 
0-72 
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three unrotated factors of a centroid analysis would approximate closely 
to the first three latent vectors obtained from a principal component 
analysis. The factor saturations on these three unrotated centroid 
factors are presented in the first three columns of Table 1.103. 

These three centroid factors, combined, by no means account for all 
the variance in the matrix of intercorrelations. Factor I accounts for 
23-7%, factor II 8:99% and factor III 6:24% of the variance. Jointly, 
39% of the variance is accounted for. This suggests that there are other 
important group factors which have been ignored. However, it was not 
the purpose of the present study to investigate these more restricted 
sources of co-variance, since they were not believed to be associated with 
either of the two major aspects of thought disorder being studied. 

The unrotated centroid factors can, of course, be given no psycho- 
logical meaning. The first factor obviously defines a sort of ‘average’ 
measure of ‘overinclusion’, ‘retardation’ and ‘general intelligence’ com- 
bined. Its composition is determined by the relative number of each type 
of test included, which was completely arbitrary. The method used to 
demonstrate that the three hypothesized factors could be produced by 
the appropriate rotation of these centroid factors will be discussed in a 
later section. 

(b) The Discriminant Function Analysis of the Factor Scores. This 
section of the analysis was devised and carried out by Mr. P. Slater and 
is described in detail in his chapter. It is proposed to discuss mainly its 
psychological implications here. It was assumed that the three centroid 
factors extracted were linear functions of ‘overinclusion’, ‘retardation’ 
and ‘general intelligence’. That is to say, three dimensions representing 
these three variables could equally well define the same three-dimensional 
space as is defined by the first three centroid factors. A major purpose 
of the present study was to discover whether or not neurotics as a group 
suffer from any degree of thought disorder. If they do not, none of these 
three factors should significantly differentiate the neurotics from the 
normal group. That is to say, the neurotic groups’ mean points should 
occupy the same part of the three-dimensional space defined by the 
three centroid factors, as the normal group’s mean point. 

In order to determine this, the two neurotic groups were first com- 
pared with each other. Factor scores on the first three centroid factors 
were calculated for all the groups. (This was done exactly, the regression 
of each test on each factor being taken into account—see chapter by 
Slater.) A discriminant function was then computed for the two neurotic 
groups. This function was by definition, that combination of the three 
factor scores which would maximize the differentiation between the two 
neurotic groups. Using the equivalent of an analysis of variance tech- 
nique (comparing the between groups variance to the within groups 
variance for this discriminant function), the two neurotic groups were 
found not to be significantly different. We can conclude from this, that 
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the two neurotic groups are not significantly different on any of the three 
centroid factors. 

The two neurotic groups were then combined, and a discriminant 
function (of the three factors scores) computed, which would maximize 
the differentiation between the combined neurotic group, and the 
normal group. Again, the difference was insignificant. We must con- 
clude that the neurotic group does not differ significantly from the nor- 
mal group on any of the three centroid factors. This suggests that in 
terms of the test variance accounted for by these three factors, there is 
no evidence of thought disorder in the two neurotic groups. 

The normal and the neurotic groups were then combined to form a 
*non-psychotic? group. The two psychotic groups were also combined. 
A. discriminant function (of the three factor scores) was computed to 
maximize the discrimination between this ‘non-psychotic’ and 'psycho- 
tic' group. This time, the difference was highly significant, the resultant 
variance ratio being 13:86, significant at well beyond the 195 level. In 
other words, these three factors taken in conjunction, differentiate the 
psychotics from the remaining subjects at a high level of significance. 
This demonstrates that one or more of the first centroid factors is 
measuring some function which significantly differentiates the psycho- 
tics, as a group, from the normals and the neurotics. 

Finally, the two psychotic groups were compared with each other. 
A discriminant function (of the three factor scores) was computed to 
maximize the discrimination between the schizophrenic and the depres- 
sive group. Again, the difference was significant, the variance ratio being 
5-88, significant at beyond the 1% level. We can conclude again that 
one or more of the first centroid factors measures some function which 
significantly differentiates the schizophrenics from the depressives. 

„This analysis so far, has not demonstrated that the function which 
differentiates the psychotics from the other groups is not the same 
function which differentiates between the two psychotic groups. In 
other words, it is conceivable that the same dimension is involved, the 
non psychotics’, the schizophrenics and the depressives all lying along a 
line. Such a picture would suggest that the two psychotic groups have 
the same sort of thought disorder, but differ only in the amount. It 
would be consistent with the theory that a single factor of ‘psychoticism’ 
is sufficient to account for the differences between the groups on this 
battery of tests. The next step of the analysis was designed to attempt 
to disprove this hypothesis. 

(c) The Canonical Analysis of the Discriminants. This section of the 
analysis was also devised and carried out by Mr. P. Slater and is 
described in detail in his chapter. Only a summary will be given here. 
The purpose of the analysis was to discover how many uncorrelated 
dimensions are needed to account for the differences between the groups 
On the three factors. Do the five groups all lie along the same dimension 
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(a single factor which might be called ‘psychoticism’), or is more than 
one orthogonal dimension needed to account for the differences between 
the groups? We have already seen that the neurotics and the normals 
are not significantly different, so that we cannot expect to find that more 
than two orthogonal dimensions are needed to account for the differ- 
ences between the ‘non-psychotics’, the schizophrenics and the 
depressives. 

The procedure employed analyses the ‘between groups’ dispersion on 
the three factor scores. A principal component analysis is applied to this 
“between groups’ dispersion to determine how many significant latent 
roots (with associated latent vectors) can be found in it. In other words, 
does all the variation between groups lie along the same line (are all the 
group means in a line in this three-dimensional space?), or are two 
orthogonal dimensions needed to describe the ‘between groups dis- 
persion'? (must two unrelated dimensions be used to describe the 
positions of the group means in this three-dimensional ‘factor score’ 
space?) 

The principal component analysis yielded three latent roots. (More 

than three would have been impossible, since there are only three factor 
scores.) Their values were 0:6995, 0-1763 and 0:0118. This suggests that, 
as far as the groups are concerned, most of the variation between the 
group means lies on one dimension. Its associated latent root (0:6995) 
is significant at beyond the 1% level. However, there is a significant 
amount of variation on a second, orthogonal dimension. Its associated 
latent root (0-1763) is also significant at the 1% level. As expected (since 
the neurotics and normals do not differ significantly), the third dimen- 
sion contains no significant variation. Its associated latent root (0:0118) 
is not significant. 
È We must conclude, then, that a single factor of ‘psychoticism’ is 
insufficient to account for the differences between the groups on the 
three factor scores. Two orthogonal ‘factors’ are required. It will be 
demonstrated later that these two orthogonal dimensions are a function 
of one factor which can be labelled ‘overinclusion’ and a second factor 
which can be labelled ‘retardation’. This analysis, of course, does not 
disprove the existence of the hypothesized third factor of ‘general 
intelligence’. Since the groups were matched in this respect, such a 
factor could not account for differences between the groups, and could 
not have emerged in this analysis. 

(d) The Rotation and Identification of the Factors. Yt was predicted 
from the theory of *overinclusion' that it should be possible to extract 
from the matrix of intercorrelations a factor which could be labelled 
*overinclusion', and that this factor should differentiate the schizo- 
phrenics from the other groups. There are two ways of testing such a 
prediction. The centroid factors could be rotated in such a way that the 
theoretically ‘pure’ measures of ‘overinclusion’ have maximum loadings 
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on one of the factors. This factor could then be labelled one of ‘over- 
inclusion’, and it should significantly differentiate the schizophrenics 
from the other groups. In fact, this method was attempted first. A series 
of rotations was performed, in the hope that such a factor could be 
achieved. This proved almost impossible. With so many variables to 
take into account, it was extremely difficult to imagine what would 
happen to them all, before each rotation was performed. The imaginary 
manipulation of fifty-four variables in a three-dimensional space is not 
easy. Although rotating a factor through a single test, or even a small 
and obvious cluster is simple enough, no method of rotation whereby a 
specified number of tests can be made to have a maximum joint loading 
on a single factor was then known to us. This approach to the problem 
was, therefore, abandoned. 

The alternative method is to obtain that factor which maximizes the 
discrimination between the schizophrenics and the other groups. Accord- 
ing to the theory, this should be a factor of ‘overinclusion’, and the 
theoretically ‘pure’ measures of overinclusion should have the highest 
loadings on the factor if the hypothesis is correct. It was this method 
Which was used. . 

The findings described earlier suggested that both the schizophrenics 
and the depressives were retarded. The schizophrenics, however, were 
also overinclusive. It was thought that this would prove to be an un- 
related abnormality. Therefore, it was predicted that that factor which 
maximized the discrimination between the depressive and schizophrenic 
group, would be a factor of ‘overinclusion’, and that the theoretically 
‘pure’ measures of overinclusion would have the highest loadings on it. 

The technical procedure used was suggested by P. Slater. The factor 
was obtained quite simply, by utilizing the data obtained in the discrim- 
inant function analysis. A discriminant function of the three centroid 
factor scores had been obtained which maximized the differentiation 
between the schizophrenic and the depressive group. Thus, the test 
loadings on the desired factor were obtained merely by pre-multiplying 
the discriminant function weights (for centroid factors I, II and III) 
into the row of factor saturations (for centroid factors I, II and IIT) for 
each test.! The resultant figure gives the saturation of the test on the 
factor which differentiates maximally between the schizophrenics and 
the depressives. This set of saturations is shown in Table 1.103 under 
the column headed ‘Factor O’. This factor was regarded as an ‘over- 
inclusion’ factor. The mean factor scores for this factor are shown in 
Table 1.104. As can be seen, only the Schizophrenics have extreme 
Scores, the other group means being nearly identical. . 

The findings already described, suggested that both psychotic groups 
Were retarded. It was thus predicted that a factor of ‘Psychoticism’ 


! These weights are as follows: factor I= — 0-1718, factor II = — 0:3196, 
factor III = 0-9318. 
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TABLE 1.104 
Mean Factor Scores for the Groups on the *Overinclusion' Factor 
Normals: — 10246 
Dysthymics: — — 11:463 
Hysterics: = 12-761 
Depressives: — — 11-215 


Schizophrenics: — 16:568 


Note: These mean scores were obtained by pre-multiplying the mean scores for each 
group on the three centroid factors (see chapter by Slater for these scores) by the weights 
obtained in the discriminant function analysis for differentiating the schizophrenics from 
the depressive group. 


would in fact be a factor of ‘slowness’ or ‘retardation’. This factor was 
obtained in a similar way. It was defined as that factor which differ- 
entiated maximally between the mean of the two psychotic groups com- 
bined, and the mean of the normal and neurotic groups combined. Again 
from the discriminant function analysis described, a set of weights had 
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FIG. 1.7. The Factors of ‘Retardation’ and ‘Overinclusion’. 
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already been obtained to apply to the three unrotated factors to produce 
this discriminant function.! To obtain the test saturations on this factor, 
these weights were pre-multiplied into each row of test saturations (for 
the three centroid factors). The resulting figure gives the saturation for 
each test on this ‘psychoticism’ factor. The test saturations on this factor 
are given in Table 1.103 under the column headed ‘factor P’. These two 
factors are reproduced geometrically in Fig. 1.7. The ‘overinclusion’ 
factor is marked on the diagram. The ‘psychoticism’ factor is marked 
‘retardation’ and is represented by the heavy black line. These two factors 
(‘Overinclusion’ and ‘Psychoticism’) are not uncorrelated. The angle 
between them is 69° 10’. The correlation between these two factors is 
— 0:484 using the formula: 
TiK (the correlation between two vectors ‘i’ and *K") 


= i'MK where M — the original 


Vi'Mi.K'MK matrix of correlation coefficients. 
Although this ‘psychoticism’ factor differentiates maximally between 
the psychotics as a group, and the combined normal and neurotic group, 
because it is correlated with the ‘Overinclusion’ factor, it also differ- 
entiates between the schizophrenic and the depressive group, the 
schizophrenics being more ‘psychotic’ (see Table 1.105). This factor 


TABLE 1.105 
Mean Factor Scores for the Groups on the ‘Psychoticism’ Factor 


Normals: 16:905 

Dysthymics: — 21:774 

Hysterics: 20-900 

Depressives: — 28:268 

Schizophrenics: 33:045 
Note: These mean scores were obtained by pre-multiplying the mean scores for each group 
on the three centroid factors (see chapter by Slater for these scores) by the weights ob- 
tained in the discriminant function analysis for differentiating the psychotic from the 
DOn-psychotic group. 


Was, therefore, rotated so that it had two properties: (1) it minimized 
the differentiation between the two psychotic groups,” and (2) it now 


2 These weights are: factor I = 0:6736, factor II = 0:7390, factor III = — 0:0056. 
The two psychotic groups will still not be exactly equal in terms of their factor 
Scores. In a discriminant function analysis, a vector which maximizes the difference 
between the means of the groups is obtained when each within group dispersion has 
een made spherical (i.e. each group has unit variance in every dimension). This pro- 
sdure should not, strictly speaking, be used when the groups differ significantly in 
of i dispersion. When the groups differ with respect to their dispersion as in the case 
dei * present study (where the schizophrenics almost certainly have a larger standard 
ii ros on all three factors), the discriminant function only differentiates maxim- 
eu tween these ‘standardized’ scores with unit variance in all dimensions. How- 

T, these slight inaccuracies are probably not important. 
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passed through the mid point of the cluster of speed tests. This rotation 
was effected by putting it at 90 degrees to the factor of ‘overinclusion’. 
This rotated factor can be called a factor of ‘psychoticism’ since it still 
differentiates the psychotics from the other groups. Indeed it is, perhaps, 
a better factor of ‘psychoticism’ since both schizophrenics and depres- 
sives now have almost equally abnormal scores on it. However, an 
inspection of Fig. 1.7 makes it apparent that *his factor is now one of 
‘retardation’, as it is defined by a large group of speed tests. This 
‘retardation’ factor is shown in Fig. 1.7 by a broken line at right angles 
to the ‘overinclusion’ factor. The loadings of the tests on the factor of 
‘retardation’ are shown in Table 1.103 under the heading ‘Factor R’. 
The correlation between the *Overinclusion' factor and the ‘Retarda- 
tion’ factor isnow only —0-377so that they are morenearly independent.* 

The third factor, which has been labelled ‘General Intelligence’ was 
defined as that factor which is at right angles to the plane defined by the 
‘Overinclusion’ and the ‘Psychoticism’ (or the ‘Overinclusion’ and 
‘Retardation’) factors. In order to obtain the test saturations on this 
factor a set of three weights was computed which defines a vector at 
right angles in the three-dimensional space to both the set of discriminant 
function weights from which the *O' factor derived, and the set of dis- 
criminant function weights from which the ‘P’ factor derived.? The 
resultant set of three weights was then pre-multiplied into the row of 
factor saturations for the three centroid factors, for each test. The result- 
ant figure gave the saturation on this ‘G’ factor for each test. These 
saturations are presented in Table 1.103 under the heading ‘Factor G’. 
The ‘G’ factor is thus by definition that factor which does not differ- 
entiate the psychotic groups from one another, or from the combined 
neurotic and normal group. Since we know that the neurotics do not 
differ significantly from the normals, the ‘G’ factor cannot differentiate 
any of the groups significantly. This is as it should be, since the groups 
were matched for pre-illness intelligence level. 


(e) The Make-Up of the Factors 


The ‘Overinclusion’ Factor. It has already been shown that the ‘over- 
inclusion’ factor differentiates significantly (and maximally) between 
the schizophrenics and the depressives. An inspection of Fig. 1.7, makes 
it clear that as expected, most of the ‘pure’ measures of overinclusion 
have high loadings on this factor. Nine out of the fifteen relatively ‘pure’ 
measures of overinclusion have loadings on this factor greater than 0:25. 


1 Even though these factors are at right angles, they are still correlated. The only 
completely uncorrelated set of factors set at right angles, are the original, unrotated 
centroid factors—or more precisely, the original principal component vectors. How- 
ever, as was pointed out, these have no psychological meaning. 

2 These weights are: for factor I = + 0-7354, for factor II = — 0-6710, for 
factor III = — 0:0945. 
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The eight highest loadings on this factor are obtained by tests of *over- 
inclusion’. Furthermore, all eight were regarded as ‘pure’ measures 
except one (Variable 35—total time for the Shaw test) which, it was 
expected, would also be a measure of slowness (as is the case). These 
results appear to confirm the theory that these tests do indeed measure 
‘overinclusive’ thought disorder in common, and that this disorder is 
characteristic of schizophrenic patients. 

Of all the twenty-three measures of ‘overinclusion’ included in the 
factor analysis, whether or not they were regarded as ‘pure’ measures, 
only four produced loadings on the ‘overinclusion’ factor opposite to 
prediction. These four tests had very small loadings indeed. The average 
‘reaction time’ on the proverbs test (Variable 59) had a loading of 
+0:19, opposite to prediction. (Note that a negative loading indicates 
the ‘overinclusive’ end of the ‘overinclusion’ factor.) Thus, overinclusive 
subjects tend to have a slightly faster reaction time, in spite of the fact 
that they talk at greater length and use more words. The total time 
taken for Pathway I had a loading of +-0-07, and the number of errors 
on Pathway I had a loading of +0-06, both opposite to prediction, but 
both very small indeed. It will be recalled that neither score differentiated 
the groups significantly. It would appear, as suggested earlier, that this 
test is too simple to be a measure of ‘overinclusion’. The other score 
Which had an unpredicted loading, was the time taken for Pathway Il. 
It had a loading of only +-0-12. Again, this score did not differentiate 
the groups significantly. . 

It was expected that the 'concreteness' ratings for the Goldstein- 
Scheerer Colour-Form test (Variable 22), the Goldstein-Scheerer Object- 
Sorting Test (Variable 23), and the Benjamin Proverbs test (Variable 57) 
would be partly a function of ‘overinclusion’, because schizophrenics 
are rated ‘concrete’ on the basis of their unusual responses. This pre- 
diction was verified for each test. The Colour-Form test has a loading 
of —0:38 (‘concrete’ scores being more ‘overinclusive’), as expected. 
The Object-Sorting test rating has a loading of —0:15 (concrete scores 
being more ‘overinclusive’), and the Benjamin Proverbs test rating has 
a loading of +0-07 (‘abstract’ scores being less *overinclusive") as 
expected. 

Several of the miscellaneous tests produced interesting saturations on 
the factor of ‘overinclusion’. Overinclusive subjects on the word associa- 
tion test produce fewer synonyms (loading = -+0-18), more often repeat 
the stimulus word (loading = —0-14), give more ‘multiple responses’ 
(loading = —0-36), give more ‘aside remarks’ (loading = —0°45), and 
More often repeat a previously given stimulus or response word 
(loading = —026). This was exactly as predicted, on the grounds that 
Overinclusive subjects would generally tend to produce more ‘distant’ 
Or "unusual" responses, although none of these scores differentiated the 
Broups at an acceptable level of significance. 
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It is also of interest, that on the Luchins test, before they have been 
given the ‘set’, overinclusive subjects more frequently use the ‘simple’ 
method of solution (loading = +0-31). While this measure did not 
differentiate the groups significantly, it throws some doubt on the finding 
that schizophrenics tend not to develop the ‘set’ as expected from the 
theory. It is possible that they went into the test situation with an undue 
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FIG. 1.8. The Factors of *Overinclusion' and ‘General Intelligence’. 


preference for the simple method even before the ‘set’ had been given, 
although why this should be it is difficult to understand. One would 
expect that, if anything, ‘overinclusive’ people should prefer the more 
complex (three-jar) solution to the simpler (two-jar) solution before the 
‘set’ is given. 

For some reason or other, even though the groups were matched for 
vocabulary, overinclusive subjects tend to have higher Mill Hill Vocabu- 
lary scores (loading —0:37). 
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As might be expected, most of the measures of ‘overinclusion’ are also 
a function of General Intelligence. Fig. 1.8 shows the results obtained 
when the ‘overinclusion’ and ‘G’ factors are plotted together. 

On some tests, intelligence is positively correlated with overinclusion. 
For example, both intelligent and ‘overinclusive’ people give a larger 
number of ‘B’ responses to the Shaw test, as well as a larger total number 
of responses. On other fests, the correlation is negative. For example, 
both ‘overinclusive’ and duller people give a large number of ‘non-A’ 
Tesponses on the Object Classification test, and a large number of ‘C’ 
and ‘D’ responses on the Shaw test. Only two of the best measures of 
‘overinclusion’ are completely independent of intelligence. These are 
the average number of words per proverb (Variable 58) and the num- 
ber of unusual responses given on the Goldstein Object-Sorting test 
(Variable 25). However, it is obviously possible to assemble a battery 
of ‘overinclusion’ tests so that the effect of intelligence is balanced out. 

The factor of overinclusion was defined as that factor which differ- 
entiated maximally between the depressive and schizophrenic groups. 
This does not preclude the possibility that depressives are, nevertheless, 
More overinclusive than normals or neurotics. In Tables 1.104 and 1.105, 
the mean factor scores for all the groups have been shown for the ‘Over- 
inclusion’ and the original ‘Psychoticism’ factor. It can be seen that, as 
a group, the depressives are no more ‘overinclusive’ than the normals 
and the neurotics. 

While it is possible to compute precisely the mean factor scores for 
the different groups, it is extremely laborious to calculate the other 
parameters of the distributions of each group (see chapter by Slater). 
In order to estimate the range of variation on these factors, and in order 
to see how well a simple, practical combination of the tests would 
measure the factors and differentiate the groups, simple ‘factor scores” 
Were derived for both ‘Overinclusion’ and ‘Retardation’. No attempt 
Was made to weight any of the tests differentially. However, care was 
taken to ‘balance’ out the influence of the other factors in the con- 
Struction of each factor score. The factor score for ‘overinclusion’ con- 
Sisted merely of the sum of the transformed scores for the following 
tests: (This does not even weight the tests exactly equally. By and large 
the transformed scores have similar standard deviations, but this is 
Not exactly the case. Standardized scores could have been used, although 

1S 1s a more laborious procedure.) 


Variable 12: Object Classification test ‘non-A’ responses. ; 
Variable 16: Luchins test: Number of 'simple' solutions following 
Set’. 
Variable 21: Epstein test: Neologism score. 
Variabe 24: Goldstein Object-Sorting test ‘unusual’ responses. 
ariable 25: Goldstein Object-Sorting test ‘Overinclusion’ score. 
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Variable 31: Shaw test number of ‘A’ responses. 

Variable 32: Shaw test number of ‘B’ responses. 

Variable 34: Shaw test number of ‘D’ responses. 

Variable 37: Nufferno Level Test Score. 

Variable 53: Wechsler Picture Arrangement Test score. 

Variable 56: USES Combined Manual and Finger Dexterity score. 
Variable 58: Average Number of Words per proverb. 

Variable 66: Shaw test total responses. 

Minus Variable 42: Word Association test number of synonyms. 
Minus Variable 57: Proverbs Test ‘Abstraction’ score. 


The results obtained with this approximate factor score are shown in 
the histograms in Fig. 1.9. It can be seen that the depressives on this 
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FIG. 1.9. Overinclusion Factor Scores. 


score are all completely within the normal range. The means for the 
groups are as follows: Normals, 25:75, Dysthymics 24-20, Hysterics, 
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27:40, Depressives, 23-08, Schizophrenics, 32:12. (Note that not all the 
subjects are included. Subjects with incomplete scores were omitted, 
instead of making the appropriate mean adjustment as in the case of 
some other calculations.) The schizophrenics on this score are signifi- 
cantly differentiated from all the other groups combined, at beyond the 
1% level. While there is considerable overlap for this score, it must be 
remembered that no attempt was made to weight the tests in accordance 
with the extent to which they measure the factor. A more accurate 
factor score would have differentiated the groups better. 

The Factor of ‘Retardation’. This factor has two properties. It differ- 

entiates between the psychotics as a group, and all the other subjects, 
but it does not differentiate between the schizophrenics and the depres- 
Sives. It was expected that this factor would be a general measure of 
‘retardation’, and indeed it was rotated into this position (at right angles 
to the factor of ‘overinclusion’) partly because it would then pass 
through a large cluster of speed tests. The highest ten loadings on this 
factor are all speed scores of one sort or another. Twenty-five scores 
have loadings greater than --0:30 on this factor. Only six of these are 
Not speed scores. 
. The most striking characteristic of the ‘retardation’ factor, is that it 
1s perfectly general. There is no tendency for any special type of speed 
measure to have the highest loading. It is thus an average measure of 
intellectual, perceptual and motor speed, and can quite fairly be labelled 
a factor of ‘retardation’. The measures of overinclusion, based on speed 
Scores, tend, of course, to have the predicted loadings on the factor of 
Overinclusion’, but these speed scores have as high loadings on the 
factor of ‘retardation’ as any of the other speed scores. Fig. 1.7 
shows the factor of ‘retardation’ plotted against the factor of ‘over- 
inclusion’, Positive loadings indicate a ‘slow’ performance. All the speed 
test loadings on the ‘retardation’ factor are consistent. The few positive 
loadings in Fig. 1.7 result from speed scores in which a fast performance 
is given a high score. (On most of the measures, a slow performance is 
Blven a high score.) 

While a detailed theory was elaborated to explain the fact that 
measures of ‘overinclusion’ define a common factor, no such theory 
has been put forward to account for the factor of ‘retardation’. It was 
initially supposed that schizophrenics would not prove to be generally 
Tetarded. Depressives were expected to be generally retarded on the 
grounds that their depressive preoccupations would interfere with their 
Performance on all types of test, producing a general distraction. It was 
Supposed that schizophrenics would be slow only on intellectual tests, 
as a ‘by product’ of their overinclusion. This prediction is clearly not the 
Case. Schizophrenics are generally retarded on all types of speed test, 
and, as can be seen from Fig. 1.7, intellectual speed tests are no more 

ighly saturated on the ‘overinclusion’ factor than are motor and 
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perceptual speed tests. Nevertheless, the ‘retardation’ factor and the 
‘overinclusion’ factor correlate to the extent of —0-377. It was not 
possible to produce a factor of retardation with a lower correlation with 
the factor of ‘overinclusion’ defined. This is hardly surprising. In the 
original matrix of correlations, the speed tests tended to correlate with 
the measures of overinclusion, and it is not therefore possible to produce 
any linear combination of overinclusion scores which will be completely 
independent of a linear combination of speed scores. Why overinclusion 
tends to produce a slight general retardation is not entirely clear. It is 
possible, however, that *overinclusive* individuals tend to over compli- 
cate the instructions of even the simplest motor tests, paying attention 
to irrelevant details, and thus retarding their performance. In short, 
‘overinclusion’ is completely unrelated to the level of complexity of 
the task. 

This fairly low correlation between ‘overinclusion’ and ‘retardation’ 
nevertheless cannot entirely explain the fact that the schizophrenics, 
on average, show as extreme a degree of general retardation as do the 
depressives. Some other possible explanations for this will be con- 
sidered in the final section. 

An approximate factor score was calculated for the factor of ‘retarda- 
tion’, in order to estimate the ranges of variation on this factor for the 
different groups. This score consisted merely of the sum of the trans- 
formed scores for the following tests: 


Variable 10: Brengelmann Picture Recognition test. 

Variable 14: Object Classification test average response time. 
Variable 19: Epstein test total time. 

Variable 39: Nufferno Speed test Al. 

Variable 40: Nufferno Speed test B1. 

Variable 41: Nufferno Speed test A2. 

Variable 47: Babcock Motor Speed tests. 

Variable 50: Babcock ‘Substitution’ test average time per line. 
Variable 55: Drawing Design test total time. 

Variable 59: Reaction Time, Proverbs test. 

Variable 60: Proverbs test, average total time. 

Variable 61: Time required for Pathway I. 

Variable 63: Time required for Pathway II. 

Variable 65: Shaw test average ‘A + B' response time. 

Minus Variable 56: USES Manual and Finger Dexterity score. 


The results for this factor score are shown in Fig. 1.10. Note that, as 
with the comparable ‘O’ factor scores, some of the cases have been 
excluded for the same reason. 

The means for this score are as follows: Normals, 28-40; Dysthymics, 
37-10; Hysterics, 31-20; Depressives, 50:38; Schizophrenics, 52:12. The 
two psychotic groups combined differ significantly from the remaining 
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subjects at beyond the 1% level. This factor score for the ‘R’ factor 
correlated —0-139 with the similar factor score for the ʻO’ factor 
(‘overinclusive’ people being slightly faster). This result is in the 
opposite direction to that obtained when the exact correlation between 
*O' and ‘R’ was calculated. This is almost certainly because these factor 
score estimates are not based on all the tests, and because the tests used 
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FIG. 1.10. Retardation Factor Scores. 


are not weighted in accordance with their factor saturations. Too much 
attention should, therefore, not be paid to this less accurate correlation 
Coefficient. However, it does emphasize the fact that the *O' factor and 
the *R? factor are relatively independent, especially when only the best 
Measures of each are taken into account. 

It is interesting that although they have the same mean score, the 
depressives are more homogeneous on this score than are the schizo- 
Phrenics, More schizophrenics are within the normal range than are 
depressives, Of the three non-psychotic groups, the dysthymics have 
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the highest mean scores. The dysthymic group is of some interest, since 
eight of the twelve members of the group had depressive symptoms, 
seven being diagnosed as ‘reactive depression’. It is not surprising that 
they tend to be similar to the ‘endogenous depressives'. Indeed, it is 
surprising that speed scores differentiate these two groups as well as 
they do, especially in view of the fact that many of the doctors who 
selected these cases believed that the ‘endogencus’-‘reactive’ distinction 
was a difficult one to make. These findings throw some doubt on the 
notion that depressed patients are generally retarded because they are 
distracted by their depressive thoughts. Both ‘endogenous’ and ‘reactive’ 
depressives were equally depressed in this respect. To some extent, the 
discrimination between these groups on speed tests may be contamin- 
ated. One of the diagnostic criteria used by the doctors in judging 
between ‘endogenous’ and ‘reactive’ depression in the present study was 
the presence of ‘retardation’ which was judged an ‘endogenous’ feature. 
It is not surprising, perhaps, that speed tests should, therefore, differ- 
entiate the groups so well. However, the fact remains that a large group 
of depressed patients is extremely retarded on all measures of speed 
of functioning, and the present study throws little light on the cause of 
this. It might well have nothing to do with the worries and preoccupa- 
tions of the patients, since these features are common to all the 
depressives. It may have some underlying physiological correlates, and 
these may be the main difference between the ‘endogenous’ and ‘reactive’ 
groups. However, one can only speculate about what these physiological 
differences might be. 

The Factor of ‘General Intelligence’. This factor is very clearly defined 
indeed. On this factor, a negative loading indicates a positive correlation 
with intelligence, a positive loading, a negative correlation (see Fig. 1.8). 
Among the highest negative loadings are the Nufferno Level Test 
(—0-80), the Mill Hill Vocabulary IQ (—0:54), the Wechsler Arithmetic 
sub-test (—0-65) and the Wechsler Picture Arrangement sub-test (—0:55). 
Furneaux has found that intellectual speed, as measured by the Nufferno 
speed tests, has a correlation of about 0-40 with ‘level’ (which might be 
identified with ‘G’ in the present context). Nufferno Speed test Al has a 
loading of +-0:29, Nufferno test A2, a loading of --0:35, and Nufferno 
test BI, a loading of +0-12, almost exactly what we would expect to 
find on a factor of General Intellectual Level. (These loadings are 
positive, as log time scores were used in the correlations. A large score 
indicates a poor intellectual speed.) 

The most striking results obtained on this factor, are the loadings 
obtained by the five measures of ‘concreteness’. It was expected that 
these tests would to some extent measure ‘overinclusive’ thinking 
(because of the ratings of ‘concrete’ conventionally applied to some 
unusual methods of sortings produced by the schizophrenics), but 
would otherwise be measures of general intellectual level. Since this 
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variable was controlled, their *G' variance would not, of course, differ- 
entiate the groups. This was very clearly borne out by the factor analysis. 
The five measures of concreteness all have very high loadings on the ‘G’ 
factor as follows: Goldstein-Scheerer Colour Form ‘concreteness’ score, 
+0:57; Goldstein-Scheerer Object-Sorting test ‘concreteness’ score, 
+-0:58; Benjamin Proverbs test ‘abstractness’ score, —0-59; Shaw test 
number of ‘A’ responsos (normal, ‘abstract’ responses), —0-83; Object 
Classification test number of ‘A’ responses (normal, ‘abstract’ re- 
sponses), —0-68. These loadings suggest that these scores are just as 
good intelligence test items, as are the two Wechsler Bellevue sub-tests 
included. 

One other finding is of interest in connection with this factor. The 
most disappointing measure of ‘overinclusion’ used in the present study, 
was the Epstein test ‘overinclusion’ score. This score did not differ- 
entiate the groups significantly, and had a very low loading on the factor 
of ‘overinclusion’. However, it has a loading of +-0-70 on the ‘G’ factor. 
This suggests that on the whole, it is dull people who underline more 
words on this test (and to a very small extent, ‘overinclusive’ people). 
This is almost certainly a function of word knowledge. Dull people, who 
do not know what all the words mean, probably tend to guess, and thus 
obtain an unusually high ‘overinclusion’ score. If this is the case, the 
Epstein test could only be used as a measure of overinclusion with very 
bright people who are certain to have learned the meanings of all the 
words prior to the onset of their illness. While the ‘neologism’ score is 
a better measure of ‘overinclusion’, dull people also tend to underline 
neologisms, presumably not realizing they are not real words (loading 
= +0°54). 


VIII. SUMMARY AND DISCUSSION 


The present research investigated three main aspects of thought disorder 
a Psychotic patients, abnormal slowness of psychological function or 
Jetardation’, ‘overinclusion’, and ‘concreteness’. A group of tests of 
Psychoticism’ was also included, but of those which differentiated the 
groups significantly, all but one was a test of perceptual or motor speed. 
Five groups were tested, twenty normal people, twelve ‘dysthymic’ 
neurotics, eight hysterics, twenty endogenous depressives, and twenty 
Schizophrenics. The groups were carefully matched on measures of pre- 
illness general intelligence, age and education. 
"n. was found that most of the tests which were thought to measure 
Overinclusive’ thinking, intercorrelated significantly. Nearly half the 
tests discriminated the groups significantly. In each case, it was the 
Schizophrenic group which obtained abnormally high scores, the other 
Broups getting low scores. A factor of ‘overinclusion’ was obtained from 
an analysis of the matrix of intercorrelations. This factor was defined as 
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that rotated centroid factor which maximized the differentiation between 
the depressive and the schizophrenic groups. It was shown that most of 
the tests of ‘overinclusion’ had high loadings on this factor. Factor 
scores were derived for each group. Again, only the schizophrenic group 
obtained an extremely ‘overinclusive’ mean score. The depressives were 
no more overinclusive than the neurotics or the normal group. It could 
thus be concluded that ‘overinclusive thinking’ is an abnormality which 
characterizes only schizophrenic patients. 

The tests of slowness of psychological function also correlated highly, 
and produced a well-defined factor which was labelled ‘retardation’. 
This factor had a low correlation with the factor of ‘overinclusion’, 
‘overinclusive’ subjects tending to be slightly retarded. It was obtained 
in such a way that it differentiated the two psychotic groups significantly 
from all the other subjects. However, it did not differentiate significantly 
between the schizophrenics and the depressives. The striking feature of 
this factor is that it is measured by all the speed tests, regardless of their 
content. It is a combined measure of intellectual, motor and perceptual 
slowness, and was thus labelled ‘retardation’. It had initially been 
thought that slowness in schizophrenia was confined to intellectual 
tasks, as it was a by-product of ‘overinclusive’ thinking in this group. 
This hypothesis was clearly disproven. *Overinclusive thinking’ produces 
a slight amount of retardation on all tests, regardless of content. Quite 
unexpectedly, however, the schizophrenic group is just as retarded on 
average as the depressive group. This extreme degree of retardation 
cannot be explained as due to ‘overinclusion’. 

Three of the individual measures of ‘concreteness’ differentiated the 
schizophrenic group from the other groups. It was shown that this was 
due to the fact that on these tests, when the conventional system of 
rating ‘concreteness’ is adopted, many of the unusual responses given by 
overinclusive schizophrenics tend to be labelled ‘concrete’. Other 
measures of the ability to ‘abstract’ or ‘generalize’ did not differentiate 
any of the groups. It was shown that ‘concreteness’ scores are almost 
entirely a function of general intelligence. They would not, therefore, 
differentiate between any abnormal groups which are matched for 
general intelligence. In other words, there is no evidence at all that any 
psychotic illness can produce a specific disability which could be 
labelled ‘concreteness’. 

While these findings are consistent, and fairly clear as far as they go, 
they nevertheless raise a number of interesting problems. In the present 
study, the entire emphasis has been on the average characteristics of 
each group. In that sense, the psychiatric diagnostic label has been the 
main reference point. The underlying assumption has been that the 
labels used refer to more or less homogeneous groups. However, this is 
clearly not the case. Perhaps the most striking fact in the present study, 
is the heterogeneity of the schizophrenic group. If the histograms for the 
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various specific measures of ‘overinclusion’ are examined (Figs. 1.1, p.30, 
1.2, p. 34, 1.3, p. 37, 1.4, p. 46, 1.5, p. 53, 1.6, p. 58) and for the ‘over- 
inclusion' factor score (Fig. 1.9, p. 96), itis clear that the ‘schizophrenics’ 
are so heterogeneous that it is difficult to regard them as a single group. 
One fact that stands out clearly, is that their range extends from the 
bottom of the normal range, to the highest scores obtained on most 
tests. On average, roughly half the schizophrenic group is within the 
normal range, while half the group is outside the normal range. On 
some of the measures, the distributions are nearly bi-modal. Thus it is 
clear that only about half the present schizophrenic patients are ab- 
normally overinclusive. The other half are normal in this respect. It is 
highly likely that this type of thought disorder is related to a specific set 
of symptoms. As has been argued elsewhere (Payne, 1960) it is tempting 
to relate abnormal ‘overinclusion’ to the development of ideas of 
reference, and paranoid delusions. Both these symptoms could be 
regarded as unwarranted generalizations from the facts. Such patients 
often tend to perceive relationships where none exist, and to take as 
evidence for their views, data which most people would judge to be 
irrelevant. No attempt was made to investigate the relationship between 
test scores, and type of symptom in the present study. The cases were 
too few, and this is the aim of a further investigation. It was certainly 
the case in the present study, however, that the two individuals who 
obtained the most extreme scores on most of the overinclusive tests, 
were diagnosed as ‘Paranoid Schizophrenic’. If this were the case, a 
further interesting problem is raised. There is evidence in the literature 
(Payne, 1960) that paranoid schizophrenics are less slow, and show less 
general cognitive deterioration than other schizophrenic sub-groups. 
It is not commonly found that they are as slow as a typical group of 
depressives'. Nevertheless, in the present study, the schizophrenic group 
was on average, just as retarded as the depressive group. However, if 
the histograms for the ‘retardation’ factor scores are examined (Fig. 1.10, 
P. 99) it is clear that the schizophrenics are heterogeneous on this 
measure, too. The ‘endogenous’ depressives generally appear retarded. 
None of them get such fast scores as the fastest normals. However, à 
number of the schizophrenics are as quick or quicker than the normal 
average, just as a number are much slower than the endogenous depres- 
Sive average. It is tempting to speculate that it is the extremely over- 
Inclusive schizophrenics, who have ideas of reference and paranoid 
delusions, who are nearly normal with respect to speed of functioning. 
(Apart from slight slowness resulting from their overinclusion.) This 
Would be consistent with the fact that these two factors are virtually 
uncorrelated. 

In other words, it seems likely, that the ‘schizophrenic’ group can be 
Subdivided into two quite distinct groups. Those with ideas of reference, 
9r paranoid ideas, might be extremely overinclusive, but not much 
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retarded. The remainder of the group might contain a number of schizo- 
phrenics who are not overinclusive at all, but who are much more 
retarded than the depressive group. It is conceivable that this sub-group 
might have catatonic symptoms. 

These predictions were tested in part on the present data, using the 
‘approximate’ factor scores described, and plotted in figures 1.9 and 
1.10. No normal subjects obtained a ‘retardation’ score in excess of 48, 
while ten schizophrenics had higher retardation scores than this. These 
ten abnormally retarded schizophrenics were compared with the seven 
schizophrenics whose ‘retardation’ scores were within the normal range. 
The ten abnormally retarded schizophrenics had an average ‘over- 
inclusion’ score of only 27:2, nearly identical with the normal mean of 
25:8. Thus, the abnormally retarded schizophrenics are clearly not 
abnormally overinclusive as a group. On the other hand, the seven 
schizophrenics whose retardation scores were all within the normal 
range, had a mean ‘overinclusion’ score of 39-14. It is clearly the non- 
retarded schizophrenics who tend to be abnormally overinclusive in the 
present study. 

It is obviously necessary, not merely to relate these psychological 
abnormalities to psychiatric labels, but to discover how they might be 
related to specific symptoms, and what their prognostic significance 
might be. In this way, objective psychological research can aid psy- 
chiatry in altering its system of classification in a way which might make 
the aetiology of some conditions easier to understand, and the prognosis 
and response to treatment more clearly predictable. 
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THE EXCITATION-INHIBITION 
BALANCE IN NEUROTICS 


Gordon Claridge 


INTRODUCTION 


THE statement that almost any objective measure of behaviour we care 
to take will inevitably reveal wide individual differences in performance 
18 so commonplace that its reiteration here may seem scarcely necessary. 
Yet, in their search for general laws of behaviour, psychologists have 
tended to minimize the importance of such variations, feeling, no doubt, 
that Overemphasis on them may unduly restrict the generality of the 
Principles they have enunciated. To the personality theorist, of course, 
it is these very individual differences which are of central importance. 
Since any attempt to describe, classify, or measure personality is, almost 
by definition, itself an exercise in the analysis of individual variations in 
response. Only fairly recently in the history of psychology has there 
been, however, a realization that a useful link can be made between 
Personality factors and previously unexplained variations in performance 
on tests hitherto largely outside the immediate interest of the personality 
theorist, Furthermore, there is now increasing confidence that the 
Systematic study of this link between these two approaches to behaviour 
Fie i can become a means of furthering, rather than handicapping, 
€ search for general psychological laws. 

Whichever way we look at the problem, however, whether we are 
i dy interested in the determinants of performance of a particular 
ae or in seeking objective measures of the variables underlying person- 
à Y; It is clear that the isolated demonstration of a connection between 
n HEB performance measure and personality is of limited useful- 

SS unless it can be related to a broader theoretical framework, which 
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should, according to its comprehensiveness, be capable of taking account 
of a wide variety of apparently different measures of behaviour. Prob- 
ably the most comprehensive framework of this kind to date is the 
attempt of Eysenck (1955a) to link a number of personality dimensions, 
previously demonstrated by him, to some of the variables proposed by 
Hull (1943) to underly behaviour in general. By doing this, Eysenck has 
been able to account in a meaningful way for individual differences in 
performance using a variety of experimental techniques. In particular, 
his suggestion that the learning theory variable of reactive inhibition 
(Ig) is at least one of the factors underlying the personality dimension 
of introversion-extraversion has led to a number of direct tests of the 
hypothesis arising from this that extraverts will show a greater tendency 
to build up inhibition than will introverts. Similarly, according to the 
original dimensional analysis, neurotics of the hysteric type, being 
extraverted, should build up inhibition much more rapidly than neuro- 
tics of the dysthymic type, who are introverted. Tests of this general 
hypothesis have been made using conditioning techniques (Franks, 
1956), perceptual motor tasks, particularly the pursuit rotor (Eysenck, 
1956a), and short perceptual tests (Eysenck, 1955a). For a fuller account 
of the theory than can be given here and for experimental evidence 
relating to it the reader is referred, in addition to the original statement 
of postulates cited above (1955b), to Eysenck (1957). 

Although itis certainly possible that variables other than those relating 
to the inhibitory process emphasized by Eysenck may ultimately be 
shown to be responsible for the results of some of the studies quoted by 
him, on the whole it is probably true to say that the original assumption 
does, in fact, account for most of the findings in the most parsimonious 
way. It was, at any rate, within this general theoretical framework and 
with the specific hypothesis linking extraversion to inhibition as a 
starting-point that the investigation reported here was planned. The 
principal aim of the study was, in fact, to investigate the relationship 
between central inhibition and extraversion in a systematic fashion, 
using as wide a variety of experimental measures as possible. Predictions 
for most of the tasks used which were relevant to the inhibition hypo- 
thesis could be made with some confidence in the case of those subjects 
whose personality could be adequately defined within the two dimensions 
of neuroticism and introversion-extraversion, viz. hysterics, dysthymics, 
and normals. 

The inclusion of a group of psychotics introduced a secondary aim 
into the study, which was to investigate the performance of schizo- 
phrenics relative to that of normal and other psychiatric groups. Pre- 
dictions on the basis of an inhibition hypothesis alone could be made 
with less certainty in the case of this group, since it was here that there 
was the greatest possibility that other variables would be found to be 
operating. The analysis of psychotic performance in previous work has, 
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in fact, been approached from a number of different angles, the two 
most important of which are a formulation in terms of some kind of 
inhibitory process and an analysis along motivational lines. That ab- 
normally rapid growth rates of internal inhibition may be characteristic 
of schizophrenics during continuous work was, indeed, a possibility 
considered, for instance, by Venables and Tizard (1956a), and reasserted 
by the same authors (1956b) following an investigation of reaction time 
in psychotics. This is clearly in keeping with Pavlov's original theory of 
‘protective inhibition’ in schizophrenia (1941). However, other workers, 
such as Rodnick and Shakow (1940), have tended to stress the poor 
motivation of the schizophrenic in terms of his inability to maintain 
attention or ‘readiness to respond’ in reaction time situations. It is, of 
course, quite apparent that these two kinds of approach are not of 
necessity basically different or incapable of being handled simultaneously 
within the framework of contemporary learning theory, particularly if 
one accepts the view, which is well supported (Ammons and Ammons, 
1951; Kimble, 1952), that reactive inhibition is a central, rather than, 
as Hull himself thought, a peripheral phenomenon. The balance between 
Positive motivational level (D) and the central negative drive state (Ig) 
may be considered a critical determinant of measured performance, as 
the modification of Hullian theory by Kimble (1949), and Kimble and 
Shatel (1952), has indicated. Nevertheless, there were still, at the outset, 
at least two possible variables on which the schizophrenics might be 
differentiated—the basic level of positive drive and the degree of 
tendency towards negative drive, or both. It was clear, therefore, at the 
beginning of the investigation that a general hypothesis about the 
behaviour of schizophrenics was difficult, if not impossible, to formu- 
late, particularly in view of the relatively wide variety of experimental 
techniques employed. For that reason it was necessary to proceed on a 
Somewhat empirical basis as far as the psychotic group was concerned, 
making predictions where previous work suggested them or, if this were 
Dot possible, applying the measures in the hope that systematic findings 
would emerge which could be incorporated within the framework of 
Eysenck's behaviour theory and subsequently yield further hypotheses, 
Which could be tested at a later date. 

The choice of experimental tasks used in the investigation was 
Soverned by the desire to take a relatively wide cross-section of be- 
haviour, covering those areas which could be roughly designated as 
Perception, attention, and learning. Results from work in the fields of 
om wes and learning have, as we have briefly indicated above, 
eoe lent support to the general hypothesis put forward by Eysenck 
ib a relationship between inhibitory growth and extraversion. In fact, 
iius of the tasks used in the present study were those employed pre- 

Ously in testing this hypothesis. Their inclusion here allowed for the 
Possibility of extending some of the previous findings, particularly by 
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the use of neurotic groups where only normal subjects had been used 
before. In addition, it provided an opportunity of making comparisons 
with the performance of psychotics on the same tests. Further tests of 
a broadly perceptual nature were added, and the problem of attention 
considered by the inclusion of a test of the vigilance type, the nature of 
which appeared particularly applicable to the problem of inhibition. 
Apart from the light which, it was anticipated, would be thrown on 
Eysenck’s theory by each test taken individually, the application of a 
number of measures to the same population of subjects was, of course, 
expected to give valuable information about the generality of the inhibi- 
tion hypothesis as an explanatory concept for extraversion. 

The aims, then, of the investigation reported can be summarized as 
follows: (1) To investigate, on a number of different measures, Eysenck’s 
hypothesis that extraversion and central inhibitory growth are positively 
related. (2) To estimate the generality of this hypothesis. (3) To study 
the performance of schizophrenics relative to that of other groups of 
non-psychotic subjects and, consequently, to suggest, if possible, how 
such findings may be incorporated within the general framework of 
Eysenck’s theory. 


SELECTION AND DESCRIPTION OF SUBJECTS 


Four groups of subjects were included in the investigation. These were 
hysterics, dysthymics, schizophrenics, and normals. All the groups were 
selected from an Army population resident in a military psychiatric 
hospital! In the case of the two neurotic groups, all consecutive 
admissions were considered for testing. They were accepted (a) if, in 
the case-notes accompanying the patient, there was consistent agree- 
ment regarding diagnosis, and (b) if there was agreement, consistent 
with (a), among the psychiatrists within the hospital who had seen the 
patient. 

The hysteric group consisted of individuals whose principal condition 
included marked hysterical conversion symptoms (five cases), fugue 
states or simple amnesia (eight cases), or periodic blackouts known to 
be of hysterical origin (three cases). 

The dysthymic group was made up of individuals who complained 
variously of frequent headaches, gastric disturbance, and irrational 
fears, and who showed marked anxiety symptoms and signs of under- 
lying tension. No obsessionals were included in the dysthymic group. 

Similar criteria were adopted in the selection of the schizophrenic 


1 The writer is indebted to Colonel J. McGhie, Commanding Officer at the Royal 
Victoria Hospital, Netley, for his kind permission to test both patient and normal 
subjects under his charge. The research was carried out under U.S. Army Research 
Grant No. D4-91-591-EUC-1136, and the writer wishes to acknowledge his in- 


debtedness to the grant-giving body. 
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group, except that an additional condition of acceptance for testing was 
the degree of co-operativeness of the patient. Subjects were rejected who 
were deemed insufficiently co-operative to be likely to complete the 
rather lengthy test battery involved. No particular specification was 
made regarding the type of schizophrenic reaction, but three patients 
were diagnosed as paranoid, five as simple, eight as hebephrenic, none 
as catatonic. o 

All but two of the normal group were male nursing orderlies in the 
hospital wards or technicians in the pathological laboratory. Two sub- 
jects were non-psychiatric patients from the surgical ward convalescing 
from minor operations. 

There were sixteen subjects in each group and, apart from age, an 
estimate of intelligence, in terms of the Army SSG rating! was available 
for each subject. Mean ages and mean SSG ratings for the four groups 
are shown, respectively, in Tables 2.1 and 2.2. Analyses of variance 


TABLE 2.1 
Age 
Mean S.D. Mean S.D. 
Hysterics 22.84 7036  Schizophrenics 23:38 — 4609 
Dysthymics 2606 6957 Normals 2134 — 2469 
TABLE 2.2 
SSG rating 
Mean S.D. Mean S.D. 
Hysterics 300 1172 Schizophrenics 338 1525 
Dysthymics 28 0977 Normals 238 1042 


revealed that there were no significant differences between the groups in 
either age or intelligence level. 

Sometime during the testing session the Eand Nscales of the Maudsley 
Personality Inventory (MPI) were administered to each subject, who 
Was also asked to complete four scales (Hy, Pt, Pd, Sc) of the MMPI. 
The respective means for the four groups on each scale of these two 
Questionnaires are given in Tables 2.3 and 2.4. 

A brief comment should, perhaps, be inserted here with regard to the 

PI scores of the various groups, particularly the two neurotic groups. 
It can be seen from the mean scores for the latter groups that the 
dy: Sthymics are, as expected, excessively introverted as well as being, of 
Ceutte, highly neurotic. There is some discrepancy in the hysteric group, 


1 Thiis estinsate of tated ; " — le: 1, 2, 3P, 3M, 4, 
of intelligence gives a rating on a six-point scale: I, 4, 75, “V's 
E 5, the latter being fhe tones rating. For the purpose of calculation the ratings 
and 3M were called 3 and 4, the next two ratings thus becoming 5 and 6. 
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TABLE 2.3 
MPI 
E-Scale N-Scale 
Mean S.D. Mean S.D. 
Dysthymics 1219 4537 38-00 5-798 
Hysterics 2400 6595 32:62 11-115 
Schizophrenics 25-75 9-930 31:56 12420 
Normals 2619 7366 20-88 10:375 
TABLE 2.4 
MMPI 
Hy Pt Pd Sc 


Mean S.D. Mean S.D. Mean S.D. Mean S.D. 
Dysthymics 66:75 10:485 74:06 16:231 63:06 18-564 72:50 24:132 


Hysterics 62:25 14.089 65-88 17-148 63-62 12791 71-12 17-469 
Schizophrenics 60:31 13-937 1238 15-815 65-75 14-092 76:69 23-480 
Normals 51-69 9-698 57.18 13-404 53-12 11:336 54-50 13-883 


however, which emerges with a rather lower mean E-scale score than 
would be anticipated on the basis of Eysenck’s original predictions 
(1953b) with regard to neurosis. Hysterics appear, in fact, somewhat less 
extraverted than the normal subjects. That this discrepancy is not with- 
out a precedent is evidenced in the work of Sigal, Star, and Franks 
(1958a), who report similar results with an unselected sample of neuro- 
tics. The detailed implications of their findings, to be found in a 
discussion between the authors (1958b) and Eysenck (1958b), do not 
concern us here. We may simply note that, as far as the initial com- 
parisons between the two neurotic groups are concerned, it is necessary 
only that the hysterics be more extraverted relative to the dysthymics, 
which in fact they are in this case to a degree which is significant at a 
high level of confidence. This is not meant to imply that further con- 
sideration of this question may not be necessary when the overall 
relationship between inhibition and extraversion is evaluated, particu- 
larly with regard to the outcome of predictions about the relative 
performance levels of hysterics and the slightly more extraverted normal 
group. Indeed, the results of such predictions may be expected to throw 
some light on the role played, if any, by variables other than inhibition. 


A DETAILED CONSIDERATION OF THE TASKS USED 


In addition to the two questionnaires, nine? objective behaviour tests 
were administered to all subjects. Six of the tests were chosen as being 


1 The test battery consisted originally of eleven tests, two of which were not com- 
pleted by all subjects and were therefore not included in the final analysis. 
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directly relevant to the investigation of the inhibition hypothesis; that 
is to say, individual differences in performance on them could, it was 
predicted, be explained primarily in terms of differential rates of inhibi- 
tory growth. These six tests were named: Pursuit-rotor, Auditory 
Flutter Fusion (AFF), Archimedes Spiral, Time Error, Vigilance, and 
Time Judgement. The three other tests were less obviously related to this 
hypothesis, but provided objective criteria of neuroticism and psycho- 
ticism additional to those derived from the questionnaire measures. 
These were named: Body Sway, Sorting Test, and Picture Recognition 
Test. Each of these tests is now considered in more detail. 

Vigilance. Those tests which may be broadly subsumed under the 
heading of ‘vigilance’ seem, for a number of reasons, to be particularly 
Suitable for the direct investigation of the theoretical position taken 
here. In their simplest form, tests of this type require the subject to 
attend over a relatively long period to a monotonous stimulus pattern 
and to watch out for and respond to ‘signals’ occurring irregularly and 
at infrequent intervals. One of the main advantages, therefore, of this 
type of task is that a relatively lengthy sample of behaviour is forth- 
coming, in which performance is largely uncomplicated by the habit 
factor, since the simple responses involved are usually at an asymptotic 
level. 

This approach to the problem of attention was taken originally by 
Mackworth (1950), using a clock-watching task. His demonstration that 
decrement occurs in vigilance during prolonged observation fits well an 
interpretation in terms of inhibition. Since then a variety of vigilance 
Situations has been investigated and the relevant studies have recently 
been reviewed by Broadbent (1958). There appears to be general agree- 
ment among most studies that a decline in efficiency, reflected in a 
failure to detect signals, occurs over time, although a number of 
experimental conditions, which do not concern us here, are known to 

iminish or even prevent this decline. In general, it may be said that the 
Conditions under which the greatest decline in vigilance may be expected 
to occur over the shortest possible time would be on tasks where the 
Signals are infrequent and irregular, where they occur within the context 
Of similar neutral stimuli, and where external motivating stimuli are 
et to a minimum. Since a situation in which there was maximum 
Coeling in the shortest time was most suited to our purposes, a task was 
one which met these criteria most adequately. Before describing “a 

en, Used, however, brief consideration must first be given to the 
Seneral nature of vigilance and its relation to personality. 
mien limits a detailed analysis of vigilance decrement here, d 
predi er of points are worth noting. The assumptions underlying the 
inp m to be made here—viz. that decline in vigilance is due to 
;nniPÉOTy processes—almost certainly implies that both forms of 


inhibit; E 
nhibition—reactive and conditioned—are likely to play a part in 
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determining performance. The demonstration by Bills (1931), for 
instance, of ‘blocks’ or rest pauses in continuous work, reformulated 
in Hullian terms, has led to an analysis (e.g. Eysenck, 1956b) which 
suggests that such rest pauses serve the purpose of partially dissipating 
reactive inhibition which has hitherto accumulated. This dissipation of 
negative drive (Ip) acts as a reinforcement for the rest pause and so, in 
the usual way, for the formation of the habit of not responding or 
conditioned inhibition (sIg). It is very likely that such an analysis is 
applicable to the problem of vigilance, where such breaks in attention 
clearly reduce the probability of detecting signals. However, whatever 
the relative importance of these two variables, the general predictions 
about individual differences in vigilance remain unaffected, since a 
more rapid growth of reactive inhibition will lead, other things such as 
positive drive being equal, to an earlier onset, and subsequently greater 
frequency, of rest pauses. Accordingly, hysterics should show more 
decrement than dysthymics, and normal extraverts more decrement 
than normal introverts. 

There is available some evidence confirming the essential correctness 
of this suggestion, although all of it appears to have been gathered on 
normal subjects. The most relevant of these is a study by Bakan 
(1960; pers. comm.), using a task on which the one employed 
here was based. He showed that in the first thirty-two minutes of an 
eighty-minute vigilance task introverts were superior to extraverts. This 
difference disappeared when a secondary task was introduced, largely 
because extraverts benefited very much more than introverts from this 
condition. The latter finding would seem, incidentally, to fit an explana- 
tion in terms of disinhibition, since the extraverts, being more likely to 
have high levels of inhibition at any one stage in the task, would be 
expected to benefit most from the disinhibiting effects of the buzzer 
response required to the secondary stimuli. 

Other evidence supporting the viewpoint that vigilance decrement is 
greatest among extraverts, is found in a number of unpublished studies 
reported by Broadbent (1958), who also quotes his own finding that a 
measure of level of aspiration, assumed to be positively associated with 
introversion, is correlated to the extent of —0-46 with deterioration in 
performance on his twenty dials test. It is clear from the foregoing that 
it was possible to predict with a fair degree of confidence the vigilance 
performance of the normal and neurotic groups used here. 

The prediction was made, therefore, that hysterics would reveal on 
the vigilance task inferior performance to that of dysthymics, the 
normal group falling between the two neurotic groups. It was also pre- 
dicted that a negative correlation would be found between vigilance 
performance and extraversion in the normal group. Since, following 
Bakan’s procedure, a secondary task condition was also included in the 
test, the assumption that this secondary task acted as a disinhibitor led 
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to the further prediction that its introduction would produce a greater 
increase in the performance level of hysterics than of dysthymics, thus 
probably causing the difference between the two groups to disappear. 
Similar but less marked changes were also expected to occur within the 
normal group, extraverts improving more than introverts. 

Predictions concerning the performance of the schizophrenics on this 
task could be made with less-certainty, although clinical impression 
would doubtless lead us to the view that they would be inferior to normal 
subjects. The consistent findings, concisely summarized by Hunt and 
Cofer (1944), that the reaction time of schizophrenics is abnormally slow 
are relevant, since the reaction time and vigilance situations clearly have 
some elements in common. If, as we considered earlier, the schizo- 
phrenic’s reaction time is, indeed, a result of poor attention, as the work 
of Huston, Shakow, and Riggs (1937) and Rodnick and Shakow (1940) 
suggests, then it is not unreasonable to assume that the schizophrenic 
will perform badly on tests such as vigilance, which are essentially 
dependent on the ability to sustain attention. An analysis in terms of 
‘protective inhibition’ would also lead us to essentially the same view. It 
seemed reasonable to propose the hypothesis, therefore, that a group 
of schizophrenics would perform at a significantly worse level than 
normals on the vigilance task. 

It would perhaps be carrying the analogy between vigilance and 
reaction time too far to suggest that we could predict the outcome of the 
introduction of the secondary task in the case of the schizophrenic 
group, although mention should be made of the work of Tizard and 
Venables (1957), who have demonstrated that the reaction time of some 
schizophrenics can be decreased by the application of extraneous stimu- 
lation. It must be noted, however, that the authors have rejected an 
explanation of this in terms of disinhibition. Because of the lack of 
direct evidence on this point no specific prediction was made regarding 
the effect of the secondary task on the performance of the psychotic 
group. 

The vigilance task used was in two parts, the first consisting of a 
thirty-minute tape-recording of digits read out at the rate of one per 
second. Some departure from randomness was unavoidable, but the 
series did not differ noticeably from a random one. Interspersed in the 
series were a number of ‘signals’, which consisted of three successive 
odd digits. The signals occurred with a frequency of one per minute, the 
exact position of each signal within each minute being determined 
randomly. The mean interval between signals was 56 seconds, with a 
range from 3 seconds to 1 minute 41 seconds. During this part of the 
test the subject was instructed to listen particularly for the groups of 
three odd numbers, which, he was told, might occur at any time during 
the test. He was told to tap loudly on the table whenever he heard such 
a group. He was then given a short practice series and, if necessary, the 
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instructions were supplemented by further explanation until it was clear 
that the procedure had been understood. 

Immediately following the principal part of the test another series of 
digits was played, lasting this time for ten minutes. Here the subject was 
told to carry on as before but was also instructed to listen out for the 
number ‘six’ in addition to the groups of three odd numbers. He was 
asked to press a buzzer in front of him in respense to the number ‘six’ 
as well as tapping in response to the main signals. The latter were again 
interspersed with the same frequency as before, the mean interval 
between signals being 53 seconds, with a range from 8 seconds to 
1 minute 40 seconds. The secondary signals occurred much more 
frequently, the number ‘six’ appearing in the series once every seven 
seconds. 

During testing the tone and volume controls of the tape-recorder 
were adjusted to a standard setting for all subjects, the setting being 
determined beforehand so as to give a clear, audible reproduction. 
Throughout the test the experimenter sat in a position as unobtrusive 
as possible and marked off on a prepared scoring sheet all signals 
detected by the subject. 

The two test scores taken for the initial analysis of results were the 
total numbers of primary signals detected in each part of the test. Totals 
were also taken for each five-minute period of the test. 

Group means for the total score measure in each part of the test are 
shown in Table 2.5. Analysis of variance revealed that there was an 
overall significant difference between the four groups in both parts of 
the test, the confidence level reached in each case being 0:001. 


TABLE 2.5 
Total vigilance score 
First part Second part 

Mean S.D. Mean S.D. 
Normals 21:06 4:02 7:69 213 
Dysthymics — 2475 4:32 5:69 1-83 
Hysterics 15:44 6:09 5.19 2:63 
Schizophrenics 1275 11-62 419 411 


The striking differences between the four groups, shown in the total 
mean scores, are also reflected in their respective performance levels 
throughout the test, as revealed in Fig. 2.1. It is clear from this figure 
that, as expected, dysthymics were superior at all stages in the first part 
of the test, while hysterics performed at an inferior level throughout, 
again as predicted. Psychotics also performed badly, although here 
there were much larger individual differences than in the other groups, 
as the standard deviation shows. In terms of total score during the first 
part of the test, dysthymics were significantly better than normals 
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(t = 24117, p < 0:02), while the normals were significantly better than 
the hysterics (t = 2-9946, p < 0:01). 

A product moment correlation calculated for the normal group 
between total vigilance score and the E-scale score of the MPI had a 
value of + 0-19, which was not significant, but in any case was in a 
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Fic. 2.1. Vigilance: Mean Performance for each Group in each five-minute 
Period of the Task. 


direction opposite from that predicted. Further inspection of the data 

suggested that the total score measure of vigilance taken initially, while 

sufficiently sensitive to confirm the overall group differences, was 

actually a relatively crude one reflecting, as it did, two distinct aspects of 

the task, viz. performance level and rate of fall-off in level. In order to 

separate these two components linear regression lines were fitted to the 
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data for each subject. This gave two measures, the coefficient ‘b’ repre- 
senting performance level (strictly starting level) and the coefficient ‘a’ 
representing mean change per trial, which, in the case of negative values 
of ‘a’, may be interpreted as ‘rate of fall-off’. The mean values for these 
two coefficients for the four groups are shown in Table 2:6. 


TABLE 2.6 


Vigilance regression coefficients 


Mean S.D. Mean S.D. 


Normals —0119 0:304 393 1106 
Dysthymics --0:007 0:236 410 1-644 
Hysterics —0297 0:353 3:61 1-621 
Schizophrenics —0-057 0-182 2:32 2:033 


It may be noted in passing that both measures confirm the inter-group 
differences demonstrated previously. Our immediate concern being 
with the normal group only, however, correlations were calculated 
between the E-scale score and the two coefficients for this group. The 
correlation between fall-off (‘a’) and E score was --0:45 and between 
Escore and level (‘b’) —0:31. Both of these just failed to reach signifi- 
cance at the 0:05 level for sixteen subjects, but the direction of the cor- 
relations was suggestive that there was a tendency for introverts to 
start at a higher level than extraverts but to fall off more. The situation 
is more clearly shown in Fig. 2.2., where the performance curves for the 
six most introverted and six most extraverted normals are compared 
with those of the two neurotic groups. It can be seen from this that 
hysterics and normal extraverts and dysthymics and normal introverts 
showed the expected similarity only in so far as they started at com- 
parable levels. It may be anticipated that this finding is of some signifi- 
cance, but a discussion of it is delayed until a later section, when à 
general interpretation of the results is presented. 

The changes consequent upon the introduction of the secondary task 
are shown clearly in Fig. 2.1 and in Table 2.5. The hysteric group showed 
a significant increase in performance level, t being 2-9213, p < 0-02, while 
the dysthymic group showed a significant decrease in level, ¢ equalling 
9.3258, p < 0-001. The results of this change were that the previous 
significant difference between the two neurotic groups disappeared, as 
predicted, t being 0-609 (NS). The change in the normal group Was 
significant at the 0-1 level only, t being 1-9211, and the relationship of 
this change to extraversion was opposite to expectation but not signifi- 
cant, r equalling —0:17. The psychotic group showed no change in 
performance level. 

1 For this purpose the data as exemplified in Fig. 2.1 were used, i.e. there were six 
‘trials’, each representing a five-minute period. 
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In general, then, it may be said that the vigilance test differentiated 
between normal, neurotic, and psychotic subjects in a predictable 
manner, thus lending overall support to the inhibition hypothesis which 
led to its use. 

Pursuit rotor. A further approach to the problem of inhibition, mainly 
reactive inhibition, is found in work with continuous motor tasks such 
as the pursuit rotor. The assumption that the phenomenon of reminis- 
cence, which can readily be demonstrated on tasks of this kind, is 


5 


DYSTHYMICS 


NORMAL EXTROVERTS 


3 NORMAL INTROVERTS 


HYSTERICS 


SIGMALS DETECTED 


FIVE-MINUTE PERIODS 


FIG. 2.2, Vigilance: Comparison for Single Task Condition between Neurotic 
Groups and six most Extraverted and six most introverted Normals. 


directly due to the dissipation of reactive inhibition has led to a deduc- 
tion, which Eysenck (1956b) has confirmed with normals, that extraverts, 
accumulating more inhibition, should show more reminiscence follow- 
ing a rest than will introverts. Following this finding the prediction was 
made, therefore, that in the present study hysterics would show greater 
amounts of reminiscence than dysthymics, with normals intermediate. 
Within the normal group reminiscence and extraversion were expected 
to be positively correlated. i . - 
As with the vigilance test, the performance of schizophrenics, parti- 
cularly in respect of reminiscence, was difficult to anticipate. The 
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expectation that their general level of performance would be lowered 
relative to that of normal subjects was certainly sustained by the study of 
pursuit rotor learning in psychotics by Huston and Shakow (1948), who 
showed that schizophrenics were very inferior on a task similar to the 
one used here. These authors did not, however, specifically investigate 
reminiscence, although they did find incidentally that retesting after an 
interval of three months resulted, in the schizophrenic group, in a slight 
increase analogous to a reminiscence effect; normals, by comparison 
starting the second session lower than the level they had reached at the 
end of the first. It is difficult to estimate the significance of these findings 
and, in any case, the testing conditions are not sufficiently comparable 
to those obtaining here to provide the basis for a definite prediction 
about reminiscence in the psychotic group. The most appropriate hypo- 
thesis in this instance seemed, therefore, to be the null one that schizo- 
phrenics and normals would not differ essentially in reminiscence. 

The pursuit rotor apparatus is too well known to require detailed 
description, and a full account of the instrument used and instructions 
given is contained in an unpublished thesis by Star (1957). Briefly, it 
consists of a rotating bakelite turntable, inserted near the periphery of 
which is a small metal disc which rotates with the turntable at a speed 
of one revolution per second. The subject is instructed to keep the tip 
of an articulated rod in contact with the small disc as it rotates in front 
of him. Contact between the disc and the metal tip of the rod activates a 
recording apparatus consisting of two electric clocks, which come into 
circuit successively every ten seconds. This enables a continuous record 
to be taken of the amount of time during each ten seconds that the 
subject is on target. The measure is normally, as here, converted to a 
measure of percentage time on target. 

The schedule followed here consisted of three five-minute practice 
periods each being separated by a ten-minute rest pause. During the 
rest pauses all subjects were asked to complete the two questionnaires 
used in the investigation. In addition to the individual scores for each 
ten-second period of the three phases a reminiscence score was calcu- 
lated for each subject. This was done by subtracting the mean of the 
last three scores before the rest period from the first score after the rest 
period. Two such reminiscence measures were, of course, taken. 

Fig. 2.3 shows the performance curves of the four groups in the three 
phases of the test. Analysis of variance carried out for the three phases 
indicated that there was an overall significant difference (p < 0-001) in 
performance between the four groups. Numerically, the performance 
measures for the groups are most concisely represented in the coeffi- 

cients for linear regression, which were ultimately computed on the data 
for each subject in each phase of the test. The group mean values for 
these coefficients are shown in Table 2.7. The aspects of the performance 
curves which we may note now, but which will concern us in more 
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TABLE 2.7 
Pursuit rotor regression coefficients* 
Phase I Phase II Phase III 
wo Wy D» P w p 


Mean 02 788  —087 2780 —138 3786 


Norm: 
a S.D. 0:436 7-205 0:982. 9:274 0:815 12335 


D ; Mean 034 791  —063 2466  —081 3290 
ysthymics SD. 0-541 6670 0995 12510 0912 14066 


Tater Mean 035 418 —050 1866 —107 2719 
TS S.D. 0342 5699 0601 12193 0:969 16466 


Mean 0:67 3:68 013 19:89 —015 2555 
S.D. 0-663 6:508 0-767 13:946 0-817 14-948 


" 
‘a’ refers to ‘slope’ or ‘average increment per trial’ and *b' to ‘performance level’. 


Schizophrenics 


defail later, are the low and high performance levels of hysterics and 
ysthymics respectively, and the rather unusual performance curves of 
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the psychotic group, particularly during the first phase of the test. Our 
main predictions, however, were made with regard to the reminiscence 
scores, known respectively as RSI and RS2. The group mean values for 
these two measures are shown in Table 2.8. While the groups were found 
on analysis of variance to be significantly differentiated at the 0-001 
level, the direction of the group differences was, as can be seen from 
Table 2.8, somewhat different from that predicted. Although dysthymics 


TABLE 2.8 
Pursuit rotor reminiscence scores 
RSI RS2 
Mean S.D. Mean S.D. 
Normals 13-7 8-51 14:8 11:53 
Dysthymics 83 7:91 62 1407 
Hysterics 8-0 9-76 12:3 9-67 
Schizophrenics 1-9 7-62 3:9 13:53 


followed the prediction that they would show less reminiscence than 
normals, hysterics failed to confirm our expectation that they would 
have higher reminiscence scores than normal. Dysthymics and hysterics 
were virtually identical on the RSI measure, while the difference in RS2, 
although in the right direction, is not significant. Within the normal 
group the correlations between reminiscence and extraversion were not 
significant, the values for r between RS1 and RS2 and the E scale being, 
respectively, —0-12 and —0-09. One notable feature, and the source of 
the overall significant difference, is the low degree of reminiscence 
shown by the schizophrenic group, the differences between the latter 
group and the normal group being significant for both RS1 and RS2 
(t equals 4:0689, p < 0-001 and 2:4222, p < 0:05, respectively). While not 
anticipated, this result appears to merit further discussion which will be 
taken up later when other, essentially similar, findings, obtained con- 
temporaneously with our own, will be presented. 

Archimedes Spiral. The applications, on the one hand, of the inhibi- 
tion hypothesis to the explanation of the learning process and, on the 
other, of the satiation hypothesis to perceptual phenomena, such as 
figural after-effects, have until recently tended to proceed somewhat 
independently of cach other. There is, however, increasing agreement 
that inhibition and satiation are essentially similar in effect and due to 
the same underlying process (Duncan, 1956), in so far as satiation, like 
inhibition, may be thought to arise centrally as a result of afferent 
stimulation and to cause a temporary resistance to the continuation of 
such stimulation. One method of proof of this hypothesis is the differ- 
ential one suggested by Eysenck (1957) whereby, either correlations are 
demonstrated between measures of inhibition and measures of satiation 
gathered on the same population, or measures of each are shown to 
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correlate with some mediating variable such as extraversion, which, 
because of its positive relationship with inhibition, should also correlate 
with the degree of satiation shown on perceptual tasks. Proof of the 
latter kind is exemplified by Eysenck’s demonstration (1955a) that 
introverted neurotics show less kinaesthetic figural after-effect than 
extraverted neurotics. Differential proof of both kinds would be ex- 
pected to be forthcoming from the present study where a number of 
measures relevant to both the inhibition and satiation hypotheses were 
taken on the same groups of subjects. 

The particular phenomenon known as the Archimedes spiral after- 
effect falls into a class of illusions in which prolonged observation of a 
moving stimulus, in this case a rotating spiral, results in a marked after- 
image when the stimulus is terminated. This after-image takes the form 
of apparent movement in the direction opposite to that obtaining during 
the original stimulation. One explanation of this in terms of satiation, 
suggested by Eysenck (1957), is that ‘the original stimulation sets up 
certain unspecified cortico-neural events which are perceived as the 
illusion. According to satiation theory, these cortico-neural events must 
produce inhibition in the structures mediating the effects, so that 
eventually the after-image is brought to a stop.’ A more consistent 
explanation, possibly, is that the original stimulation itself produces 
inhibition in addition to the excitatory state set up simultaneously. The 
after-effect being, it is assumed, dependent on the strength of excitation 
at the point when the stimulation is terminated, the length of the sub- 
Sequent after-image would then be determined by the balance, at that 
point, between the excitatory and the opposing inhibitory potentials. 
Whichever explanation is correct, the prediction would be the same, 
that the length of the after-effect found would be negatively correlated 
with extraversion. Work with this test on the brain-injured, notably 
that of Price and Deabler (1955) and Gallese (1956), and to some extent 
that of Holland and Beech (1958), tends to confirm this, in so far as 
these subjects showed diminished after-effects consistent with the extra- 
verting effects (Petrie, 1952) of brain damage. The authors of the first 
of these two papers also agree that functional psychotics show after- 
effects which do not differ from normal. 

It was predicted here that hysterics would show significantly shorter 
after-effects than dysthymics and that, consistent with this, there would 
be negative correlations within the normal group between length of 
Spiral after-effect and extraversion. It was further hypothesized that 
schizophrenics would not differ essentially from normals in the length 
of their after-effect. 

The apparatus used consisted of a single-throw 180 degree spiral 
Mounted on a spindle driven by an electric motor, the speed of which 
could be accurately controlled by means of a strobe attachment. A 
reversal switch allowed the spiral to be rotated in both a clockwise and 
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an anticlockwise direction. The test was administered in a partially 
darkened room and the spiral illuminated by a 100-watt bulb suspended 
about nine inches above it. Strict control of illumination was difficult to 
obtain and this, and other factors such as angle of regard, were not 
rigidly controlled, but, in any case, variables other than the length of 
stimulation have been shown (Holland, 1958) to have a minimal in- 
fluence on the spiral after-effect. The subject was seated about four feet 
in front of the apparatus and asked to fixate the centre of the spiral 
while it was rotating. He was told to continue fixating after the spiral 
was stopped by the experimenter, and to report immediately the after- 
effects of it ceased. The length of the after-image was measured with a 
stop-watch. Four trials were given in all, alternately clockwise and then 
anticlockwise. In each trial the spiral was rotated for one minute and 
a rest of one minute was given between trials. Four scores were, there- 
fore, obtained from each subject. 

Analysis of variance indicated, in addition to a significant overall 
difference between groups (p < 0-001) that there was no difference in 
after-effect score due to direction of rotation. The mean of four trials 
was, therefore, taken as the measure of performance on this test, the 
group means for this measure being shown in Table 2.9. 


TABLE 2.9 
Spiral after-effect 
Mean S.D. 
Normals 10-16 4:560 
Dysthymics 15-78 5:486 
Hysterics 9-74 5:337 
Schizophrenics 14-78 6:996 


As predicted, hysterics showed a shorter after-effect than dysthymics, 
the difference being significant (t equals 3-0659, p < 0:01). Dysthymics 
had a longer after-effect than normals, again as predicted, t being equal 
to 3:0504, p — 0:01. Hysterics did not differ markedly from normal in 
length of after-effect, although, perhaps surprisingly in view of the 
previous results quoted, the test differentiated between the normal and 
schizophrenic groups (t equals 2:1488, p « 0:05), the latter having à 
longer after-effect than normal. Again it must be noted in this respect 
that, as with most other measures taken, the standard deviation for the 
psychotic group was somewhat larger than that for other groups, 
indicating a lack of consistency in the behaviour of schizophrenics as à 
group. This is not surprising in view of the wide variety of clinical 
reactions subsumed under this heading. 

Within the normal group the length of after-effect was found to be 
significantly related to extraversion, r between the E score and length 
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of after-effect being —0-596, which for sixteen subjects is significant at 
the 0:01 level. This finding, together with the inter-group differences out- 
lined above, thus provides fairly substantial support for most of the 
hypotheses proposed with regard to this test. 

Time Error. The presence of a particular constant error, known as 
time error, in judgements of sucessively presented stimuli has concerned 
psychophysicists for many years. Briefly, this error takes the form of a 
systematic tendency to overestimate or underestimate the magnitude of 
the second of two physically identical stimuli occurring successively. 
The effect has been investigated, although not always demonstrated, 
using a variety of stimulus material, including the pitch and intensity 
of sound (Postman, 1946), light intensity (Kreezer, 1938), hefted weights 
(Guilford and Park, 1931), and visual extent (McClelland, 1943). In 
general, though not universally, the findings have been that (a) negative 
time errors—the tendency to overestimate the second of a pair of stimuli 
—tend to predominate over positive ones; and (b) the tendency towards 
negative time error increases as the interval between the two compared 
stimuli increases (Kohler, 1923; Needham, 1935). 

A number of theoretical explanations, none entirely satisfactory, have 
been proposed for this constant error and these have been adequately 
reviewed by Osgood (1953). The investigation of the problem here 
derives from its relevance to the satiation, and therefore the inhibition, 
hypothesis, following Kohler’s original trace theory of time error. An 
analysis along these lines would be formulated as follows. The applica- 
tion of the first of a pair of stimuli, say a tone of a certain intensity, will 
Produce, in addition to the expected excitatory effect, an inhibitory 
Process opposing the continuation of this excitation. A following tone 
of equal intensity will then be judged louder according to the degree to 
which the inhibition consequent upon the first tone has ‘damped down’ 
the original excitation. One of the factors determining this would be, as 
the evidence confirms, the duration of time before the second stimulus 
I$ presented or, as it is usually called, the S-V interval. One difficulty 
with this general explanation of time error as it stands is the demonstra- 
tion in some studies that where the S-V interval is very short, say up to 
two seconds, positive or zero errors may tend to replace the more usual 
Negative errors. This is so in the case of judgements of sound intensity 
(Postman, 1946), with which we are particularly concerned here. A 
Possible explanation of this is that, when the second stimulus follows 
ttie first very rapidly, the original inhibition induced tends to oppose 
what is effectively a continuation of the first stimulus. In order to be 
Judged equal, in fact, the second stimulus would then have to be of a 
Physically greater intensity, thus giving rise to positive time error. 

In addition to the general influence of the S-V interval on the degree 
ee time error it would be expected, if our argument so far is correct, 
that individual differences would appear and that these would be 
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related to extraversion. Extraverts, producing more inhibition as a 
result of the first stimulus, would be expected to show a greater tendency 
than introverts to overestimate the intensity of the second of two 
stimuli. Accordingly the general prediction was made here that hysterics 
would show a greater tendency towards negative time error than 
dysthymics, while normals would tend as a whole to fall between these 
two groups of subjects. It was also predicted that extraversion and 
tendency towards negative time error in normals would be positively 
correlated. The appearance at the shorter S-V intervals of positive time 
error in some subjects was not unlikely, although exact predictions 
about group differences in this respect could not confidently be made. 
It could, however, be predicted that there would be a general tendency 
in all groups for negative time error to increase as the S-V interval 
increased, and it was expected that this change would be greatest 1n 
hysterics and least in dysthymics. g 

No specific prediction was made with regard to the schizophrenic 
group, although the ‘protective inhibition’ approach to schizophrenia 
would lead us to expect larger negative time errors in these subjects 
than in normals particularly if, as was the case here, the stimuli involved 
are relatively intense. 

The stimulus material used in the investigation of time error con- 
sisted of a tape-recording of pure tones, all of a frequency of 1100 c.p.s. 
The material was divided into four series corresponding to the four S-V 
intervals of 1, 2, 4, and 6 seconds. The standard tone, which was always 
presented first, had an intensity of 75 decibels and a duration of 1 second. 
Within each series there were five variable tones, all of one second 
duration and with respective intensities of 73, 74, 75, 76, and 77 decibels. 
Each of these was paired ten times with the standard in each series, the 
order of presentation of pairs being determined randomly. There were 
thus fifty judgements to be made in each of the four series, with an 
interval of six seconds between pairs and a rest of five minutes between 
series. The stimuli were presented to the subject through head-phones 
and the volume of the tape-recorder was adjusted to the correct setting 
by means of a calibrated meter. 

The subject was instructed to decide as quickly as possible whether 
the second sound of each pair was louder or softer than the first sound, 
no ‘equal’ judgements being allowed. He was then asked to encircle the 
appropriate response on the answer sheet. The measure of time error 
taken was calculated according to the following formula: 


(Number of judgements "softer —number of judgements ‘louder’) x 100 
Total number of judgements 


There were, of course, four such measures for each subject. 
The average degree and direction of time error shown by the four 
groups at each S-V interval can be seen in Fig. 2.4. While, as expected, 
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there is in all subjects a general tendency towards negative, rather than 
Positive, time error, this tendency can be seen to be greatest among 
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a stetics and psychotics and least among dysthymics, with gome 
termediate, It can also be seen that, confirming our predictions an 
Previous work, the time error effect becomes more negative the longer 
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the S-V interval and this occurs to a differential extent among the four 
groups, which tend to diverge. The qualification to this statement is, of 
course, that at the six-second S-V interval all groups, except the psycho- 
tics, reveal a movement away from further negative time error and tend 
to approach a level of zero error. The explanation of this is unclear, 
although one possibility is that, at this and longer time intervals, the 
interaction between excitatory and inhibitory processes, which we have 
proposed to underly this phenomenon, becomes minimal and judge- 
ments tend to be made on a random basis. An interesting possibility, 
which might support this, is the differential reaction shown by the 
psychotic group. If, as we consider earlier, ‘protective inhibition’ under- 
lies the schizophrenic’s performance on this test and if, as has recently 
been suggested by Venables (1959), it is the rate of dissipation of this 
inhibition which is critical in schizophrenia, then it would not reach a 
minimal level until an S-V interval is reached which is longer than that 
required to show the zero effect in normals. The meaningfulness of such 
an interpretation must, however, await further experimentation. 


TABLE 2.10 
% Time error measure 
Mean S.D. 
Normals —344 8:661 
Dysthymics —0:56 10-868 
Hysterics —10-78 11:306 
Schizophrenics — 12:25 15:237 


A two-way analysis of variance revealed that the four groups were 
significantly differentiated at a confidence level of 0-001. A single measure 
of time error was then obtained by taking the mean of the values for the 
four S-V intervals. The group means for this measure are shown in 
Table 2.10. A number of calculations based on this measure revealed that 
dysthymics and hysterics were significantly differentiated (t equals 
2:5297, p < 0-02), as were normals and hysterics (t equals 2:0618, 
p < 0:05). Normals and psychotics were differentiated at the 0-1 level 
only (t equals 2:0109) while normals and dysthymics did not differ 
significantly in mean time error. The correlation between the time error 
measure and E score for the normal group had a value of —0-24, which, 
while not significant, indicated, as predicted, some tendency for the 
more extraverted in this group to show greater negative time error. 

Time Judgement. All of the tasks considered so far involve the 
measurement of some kind of performance change due, it is hypothe- 
sized, to the temporal growth of inhibition consequent upon particular 
kinds of positive test stimulation. Measures of a rather different sort, 
however, might be expected to give an indication of the state of the 
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subject with respect to the inhibitory process. If, for instance, we take 
a task in which the subject is asked to reproduce unfilled time intervals 
it might be expected that satiation arising from the limited stimulation 
available will be one of the factors determining his degree of under- or 
over-reproduction of the interval. In mentalistic terms the subject will 
become more or less rapidly bored, the rapidity with which this occurs 
being determined, we would suppose, by the position of the subject on 
the extraversion-introversion continuum. What might be thought to be 
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FIG. 2.5. Time Judgement—Mean Error of Estimation for each Group at four 
Time Intervals. 


Such behavioural concomitants of an underlying inhibitory process has 
certainly been associated in a rather different situation with extraversion 
(Wyatt and Langdon, 1937). Applying this to the present problem, it 
Seems possible that the individual differences which have been noted in 
time perception (Woodrow, 1933) may be meaningfully analysed within 
the present theoretical framework. . : . 

In order to investigate this a simple test was designed in which the 
Subject was asked to reproduce a number of time intervals of varying 
duration. These were 10, 30, 60, and 90 seconds, each being presented 
twice. The beginning and end of each time interval was defined by the 
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experimenter tapping on the table with a pencil. The subject was 
instructed to tap a third time when he thought the same amount of time 
had passed following the experimenter’s second tap as had elapsed 
between the two taps. The time allowed to elapse before the subject’s 
response was measured with a stop-watch and the degree of error 
calculated by subtracting the actual from the estimated time. A negative 
value thus indicated under-reproduction and vice versa. The mean of 
the two values for each of the four time intervals was taken as the 
measure of response for that interval. During testing the subject was 
separated from the experimenter by a low screen and his watch removed. 
He was asked specifically not to count or try to ‘measure’ the time 
actively, but simply to try and imagine what the intervals were. 

It was predicted that hysterics would show a greater tendency towards 
under-reproduction of the intervals than would dysthymics and that 
the more extraverted normals would behave more like hysterics, while 
the introverted normals would behave like dysthymics. No specific 
prediction was made regarding the schizophrenic group. 

The overall differences between the groups on this test were not 
significant, the F-ratio just failing significance at the 0:05 level. Results 
were, however, roughly in the direction predicted, as shown in Fig. 2.5. 
Hysterics showed greater tendency to underestimate the time intervals 
at all levels than did dysthymics, although the latter, contrary to 
expectation, tended to show more error than the normal group. The 
group means for a single measure of time judgement—the mean of four 
time interval scores—are shown in Table 2.11. 


TABLE 2.11 
Time judgement measure 
Mean S.D. 
Normals —211 6:641 
Dysthymics —5:07 5:157 
Hysterics —7-01 6:931 
Schizophrenics —6-38 12:713 


Within the normal group correlations indicated that, with the excep- 
tion of the 90-second time interval, extraverts tended to underestimate 
the intervals to a greater extent than did introverts. The correlations 
were as follows: 


*r' between E-scale and error 
10-second interval: —0:51 p 0:05 (N = 16) 
30-second interval: —0:66 p 0:01 (N — 16) 
60-second interval: —0:51 p 0:05 (N — 16) 
90-second interval: —0-01 n.s. (N = 16) 
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The correlation between the overall measure and E-scale had a value 
of —0:497 which is significant at the 0:05 level. 

Auditory flutter fusion. The relevance of auditory flutter fusion to the 
theoretical position taken here arises essentially from the assumption 
that individual differences in the perception threshold for auditory 
flutter are explicable in terms of differential rates of growth and dissipa- 
tion of the inhibition arising as a result of stimulation by a sound signal. 
The exact determinants of this threshold have been suggested in the 
work of Miller (1948), Miller and Taylor (1948), and Symmes, et al. 
(1955). Their analysis and its relevance to Eysenck’s theory have been 
discussed in detail by Holland elsewhere in this book, and its repetition 
here will be confined to a brief outline of the main theoretical points. It 
has been suggested by the above authors that in order for flutter to be 
discriminated from fusion in interrupted noise the ‘off period of the 
sound/time fraction must be sufficiently long to allow the auditory 
sensation to decay one detectable step or j.n.d. Thus, if signal intensity 
is constant, for any value of the sound/time fraction the interruption 
rate of a burst of noise at which flutter is observed (i.e. the flutter 
threshold) will depend entirely on the rate of decay of the auditory 
sensation. Individual differences in decay rate should, therefore, be 
paralleled in differential flutter thresholds. Since it is argued here that 
the rate of decay of the auditory sensation during the ‘off’ period of the 
sound/time fraction is a function of disinhibition, then extraverts, dis- 
sipating inhibition slowly, would be expected to have lower flutter- 
fusion thresholds than introverts, who dissipate inhibition quickly. 

In the present study it was, therefore, predicted that hysterics would 
show significantly lower flutter-fusion thresholds than dysthymics, with 
normals falling in an intermediate position. It was also expected that 
there would be negative correlations within the normal group between 
extraversion and the threshold value. UN WS 

The test used was the University of Chicago Sound Discrimination 
Test, details of which are contained in the manual accompanying the 
test. Briefly, it consists of a recorded tape of interrupted *white noise? 
having square wave characteristics. The tape was played to the subject 
through earphones at an intensity of 70 decibels, the volume of the tape- 
recorder being adjusted to this intensity with the aid of a calibrated 
meter, The repetition rates used varied from 10 to 150 i.p.s. in steps of 
10i.p.s., and the sound/time fraction had a constant value of 0:90. Four 
Series were given, within each of which the fifteen frequencies were 
paired at random with bursts of continuous noise. The burst duration 
for continuous and interrupted noise was two seconds and the intra- 
Pair delay one second. The subject was simply asked to indicate after 
each pair whether the two sounds were the same or different. The 
threshold for flutter fusion which was derived from the total responses 
Constituted the subject’s score on the test, 
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The results for this test are shown in Table 2.12. Contrary to expecta- 
tion, the differences between the four groups were not significant on 
analysis of variance. Indeed, the difference between the hysteric and 


TABLE 2.12 
AFF score i 
Mean S.D. 
Normals 76:56 23-241 
Dysthymics 65:31 20:354 
Hysterics 65-00 15811 
Schizophrenics 60-31 21:691 


dysthymic groups was negligible, both having lower AFF scores than 
the normal group. The correlation between E score and AFF threshold 
in the normal group was also insignificant, r being —0:09. The most 
noticeable feature of the results is that the test seems to discriminate not 
so much between introverts and extraverts as between the normal and 
psychiatric groups. This seems to parallel findings for visual perception 
(e.g. Krugman, 1947) which have indicated that the CFF is also lowered 
in psychiatric patients compared to normal subjects. While we shall 
have occasion to refer briefly to this point again, it must, of course, be 
concluded here that none of our predictions relating to the AFF test was 
confirmed. Consequently, it may be said that there is no evidence from 
the present results, at least, that individual differences in flutter threshold 
can be related in a predictable manner to extraversion, although findings 
more in line with prediction are reported elsewhere in this book by 
Holland. 

Picture Recognition Test. This is a test devised by Brengelmann (1958), 
whose application of it to normal, neurotic, and psychotic subjects 
indicated that the overall difference in the scores of these groups was 
highly significant, normals being clearly differentiated from the two 
patient groups. The latter did not differ markedly from each other and 
a simultaneous test of Eysenck's hypothesis revealed that hysterics and 
dysthymics, and extraverts and introverts, did not differ in their scores 
on the test. The test itself consists of a box illuminated inside and having 
a variable speed photographic shutter fixed at one end. A picture is 
inserted in the box at the end opposite to the shutter and the subject 
is required to view through the shutter aperture for various exposure 
times and to report what he sees. A maximum of eleven exposures for 
each picture is given, these being arranged as follows: exposures 1 and 
2 are given at a shutter speed of qoy Sec., 3 and 4 at $ sec., 5 and 6 at 
4 sec., 7 and 8 at 1 sec., 9 and 10 at 3 secs., and 11 at 30 secs. The 
subject's score on cach picture is the number of the exposure at which 
he correctly recognizes the picture, unless he fails completely when he 
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is given a score of 12. The total test score is the sum of the individual 
scores from each of the six pictures. 

The mean score for each group on this test is shown in Table 2.13. 
Analysis of variance indicated that the overall differences between the 
groups were not significant, although it can be seen from the table that 
the group means are in the expected direction, the three patient groups 
showing generally longer recognition times than normal. 


TABLE 2.13 
Picture Recognition Test 
Mean S.D. 
Normals 39:31 7:132 
Dysthymics 40:38 3:931 
Hysterics 44:75 7-084 
Schizophrenics 40:81 9-726 


Body Sway Test. This is a test of primary suggestibility, shown by 
Eysenck (1947, 1953b), and ona number of other occasions subsequently, 
to be closely associated with neuroticism but not with extraversion. The 
particular apparatus used here consisted of a simple pulley arrangement 
which allowed a measure of forward and backward sway to be taken on 
a vertical scale graduated in inches. The suggestion that he was falling 
forward was made to the subject by means of a tape-recording of the 
experimenter’s voice, the suggestion being continued for two minutes. 
Both total backward and forward sway were noted, subjects falling 
completely in either direction being given a sway measure for that 
direction of six inches. Total sway regardless of direction was also 
calculated for each subject. Mean sway for each of the four groups is 
shown in Table 2.14. As expected, the two neurotic groups were alike 


TABLE 2.14 
Body Sway: total sway 
Mean S.D. 
Normals 2:05 1:319 
Dysthymics 277 1:833 
Hysterics 3:08 2-169 
Schizophrenics 2:34 2:056 


in Showing the greatest amount of body sway, with normals showing 
least sway. Analysis of variance revealed, however, that these differences 
Were not significant. T 
Sorting Test. Here the subject is asked to sort a number of objects in 
as many ways as possible and to give adequate explanations for each 
Sorting method. Tests along these lines have been used for a number of 
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years in the study of ‘abstract’ thinking (Vigotsky, 1934; Goldstein and 
Scheerer, 1941) and the general finding of impairment of concept 
formation in schizophrenics is too well known to require elaboration 
here. A thorough examination of the problem has been made by Payne 
(1954), whose sorting test was used in the present study and who gives 
a detailed description of its use in the investigation of concept formation. 
Briefly, the material used consisted of twelve objects of common geo- 
metrical shape, viz. four square, four circular, four triangular. All the 
objects were different, but could be grouped together in a variety of 
ways according to a number of characteristics. These included, in 
addition to shape, such qualities as size, thickness, material, weight, and 
colour (both hue and brightness). After a preliminary demonstration 
if the subject did not grasp the instructions, he was asked to sort the 
objects in as many ways as possible, giving a reason each time for the 
sorting method used. Two scores were taken on the test, viz. the number 
of correct sortings and the total time taken, the latter being converted 
to a log. time value. 

The group mean values for these two scores are given, respectively, 
in Tables 2.15 and 2.16. It can be seen that schizophrenics are, as pre- 
dicted, poorest on both measures derived from this test, showing fewer 
correct sortings and a longer time than the other three groups. The 


TABLE 2.15 
Sorting Test: number of correct sortings 
Mean S.D. 
Normals 425 1:479 
Dysthymics 3:56 1:972 
Hysterics 3:50 3:969 
Schizophrenics 2:56 2:123 
TABLE 2.16 
Sorting Test: log. time 
Mean S.D. 
Normals 2:5094 0:3059 
Dysthymics 2:5032 0:2557 
Hysterics 2:5745 0:2464 
Schizophrenics 2:6701 0:2429 


differences between the groups were not, however, statistically significant 
on analysis of variance. 


DISCUSSION OF GROUP DIFFERENCES 


Neurotic and normal groups. From the overall group differences a 
number of consistent findings emerge which essentially confirm most 
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of the hypotheses proposed. It is clear that, with the exception of a few 
measures, hysterics will differ from dysthymics in a systematic and 
predictable fashion and that, again with a few exceptions, normals will 
tend to fall midway between these two groups. Hysterics are found to 
have poor vigilance, a relatively short spiral after-effect, large negative 
time errors in the judgement of sound intensity, and a tendency to under- 
estimate time intervals to a greater extent than dysthymics. Further, in 
so far as the hysteric group emerged significantly more extraverted than 
the dysthymic group the theoretical position adopted at the beginning 
of this paper seems to be substantiated. Some discrepancy arises, how- 
ever, when comparisons are made between the normal group and the 
two neurotic groups, particularly between hysterics and normals. The 
attentive reader will recall the finding, reported earlier, that the hysteric 
group did not emerge more, but slightly less, extraverted than the 
normal group. This, of course, was not predicted. Taking this into 
account, it is apparent that the hysterics’ performance is somewhat 
exaggerated, out of proportion as it were, to the degree of extraversion 
Present in this group. A somewhat similar effect, but in reverse, was seen 
in the results of the vigilance test, where dysthymics and normals of 
equivalent degrees of introversion differed markedly in their fall-off on 
this test. Since the inhibition hypothesis proposed as a general explana- 
tion for the results seems to be fairly substantially confirmed, it is 
m necessary to account for these disproportionate inhibitory 
eitects, 

One possibility is that motivational differences of some kind were 
operating to produce these effects. A source of such differences between 
the hysteric and dysthymic groups, at any rate, might conceivably lie 
in the N-scale of the MPI, assuming neuroticism as measured by this 
Scale to act as a drive. Sigal, Star, and Franks (19582), for example, have 
reported that hysterics tend to have lower N-scale scores than dysthy- 
mics. In the present sample the difference between the two neurotic 
groups in N-score was certainly in this direction but was not significant. 
Inspection, however, of responses to individual items of the N-scale did 
Teveal some differences between the two groups. It was found, for 
example, that to the question *Would you rate yourself as a tense or 
high-strung individual?’ out of sixteen subjects in each group twelve 
dysthymics, but only four hysterics, answered in the affirmative. Chi- 
Square calculated between these frequencies had a value of 8-56 (d.f. 2), 
Which is easily significant at the 0-02 level of confidence. While this 
Particular finding is in itself very slender evidence it is perhaps not 
Lua asenta] that it tends to confirm the clinical differences normally 
ie to distinguish these two groups of neurotics. Traditionally, the true 
ad has been recognized as lacking in drive, or more specifically 
thi Ing in overt anxiety or tension; the ‘belle indifference accompanying 

I5 is well known. By contrast, the dysthymic is characterized by high 
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levels of ‘free’ anxiety and tension. Although no evidence is forth- 
coming in the present study of the likely source of such differences, if 
they exist, in the sample used here, the work of van der Merwe (1948) 
has demonstrated some of the autonomic concomitants of what appears 
to be the same underlying variable, hysterics and dysthymics being 
shown by him to fall on either side of a normal group on a measure of 
emotional tension. It seems not unlikely, therefore, that differences in 
what we may justifiably subsume under the heading of ‘drive’ may have 
accounted partly for the inter-group differences. Further support for 
this will be presented later, but meanwhile attention must be turned to 
the probable effects of drive on the experimental findings. 

It is quite apparent that the effect of drive is likely to be particularly 
important on tests such as vigilance and the pursuit rotor. The asymp- 
tote of performance reached on the latter, for example, is clearly 
determined to a large extent by drive and it is noticeable in this respect 
that hysterics performed at a markedly lower level than other groups 
on the pursuit rotor. Similarly, the low reminiscence scores of hysterics 
may be accounted for if we assume a lowered degree of drive in this 
group, since, as Eysenck (1956b) has pointed out, reminiscence is closely 
related to drive, because the lowered ‘critical level’ associated with it 
limits the individual’s tolerance of inhibition and thus allows less to 
build up. One other consequence of changes in the critical level is that 
rest pauses during performance are likely to occur earlier and more 
frequently. The reader may be briefly reminded here of the process in- 
volved. The onset of reactive inhibition is expected to culminate in a 
cessation of behaviour when Ip equals the level of positive drive, that 
is at the critical level. Through the reduction of negative drive (Ip) the 
rest pause becomes a conditioned habit of not responding (sIg). The 
possible analysis of vigilance in these terms has already been discussed 
in considering this test and Broadbent (1958) has recently indicated the 
importance of rest pauses in determining vigilance performance. The 
poor vigilance performance of hysterics and the superior performance 
of dysthymics relative to normals of equivalent introversion would 
certainly be consistent with such an analysis. 

An alternative possibility is that increases and decreases in drive 
result not simply in differential tolerance levels of inhibition, but serve 
to alter the individual's susceptibility to it. Such an analysis would 
accord well with a central view of drive, such as that of Hebb (1949), 
particularly the later modifications to his theory (1955), which identifies 
drive as a state of cortical arousal arising from the activity of the brain 
stem reticular formation. This interpretation of drive may be meaning- 
ful in the present context in so far as it would take account of a possible 
interaction between excitatory and inhibitory processes, both occurring 
at a central level. In terms of the present theory it could be thought of 
tentatively as a shift, possibly temporary, in the balance of excitation- 
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inhibition proposed to underly the personality dimension of intro- 
version-extraversion. A heightened state of drive or arousal in the 
dysthymic, for example, could be thought of as effectively a shift towards 
greater excitation or, conversely, lower inhibition. A corresponding 
lowering of the arousal level in hysterics could similarly account for the 
apparently disproportionate inhibitory effects seen in this group. Parti- 
cularly relevant to this farmulation of drive is the suggestion, considered 
above, that the drive differences under discussion may have an auto- 
nomic basis, since the level of arousal is almost certainly determined 
(Hebb, 1955) by the amount of internal as well as external stimulation. 

Clearly, such an interpretation of drive does not essentially contradict 
the analysis of pursuit rotor and vigilance performance discussed above. 
On tasks of this kind drive in the Hullian sense may still be expected to 
have a limiting effect, via the critical level, on the level of inhibition 
reached before rest pauses begin. The opportunity, during such enforced 
Test pauses, partially to dissipate previously acquired inhibition may be 
expected to have a ‘levelling’ effect on measures such as reminiscence, 
in the manner discussed earlier. A further factor complicating the 
reminiscence measure is the role played by habit. As we shall see when 
the pursuit rotor performance of psychotics is discussed, the reminis- 
cence measure is, in fact, a multiplicative function of both habit 
Strength (Hg) and inhibitory potential (Ir). Any reduction in habit 
level, as the pursuit rotor curves of hysterics suggest may be the case 
in this group, will necessarily lower the actual reminiscence changes 
found, even though the inhibitory component has a relatively high value. 
(The reader is referred, for a fuller discussion of this point, to Treadwell, 
elsewhere in this book.) Indeed, it is possible that reminiscence scores 
will Prove unsuitable as direct measures of inhibition, at least where 
variations in drive and/or habit are expected. 

Despite the lack of certainty in the results and the consequent need 
for further investigation, it seems not unreasonable on the evidence 
available to conclude that differences in drive or ‘arousal’ level in neuro- 
tics may serve to exaggerate the effects of inhibition on performance 
Which are predictable in normal subjects from extraversion alone. The 
Possibility that an actual shift occurs in the excitation-inhibition balance, 
while according well with modern views of drive as a central process, can 
only be regarded as tentative. Nevertheless, it would seem important 
theoretically to determine exactly the extent to which the predicted 
Telationship between extraversion and inhibition is altered by such 

actors as drive once we go beyond the middle ranges of normality. In 

Order to clarify some of the issues emerging from the present findings 

further work is necessary, particularly in establishing more exactly the 

manner in which possible drive differences in hysterics and dysthymics 

may be paralleled by changes on autonomic and other measures. Sup- 

P “menting this, direct comparisons, on tests of the kind used here, 
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between normals and neurotics of equivalent levels of extraversion and 
introversion would also be expected to throw further light on the 
problem. Unfortunately, in the present study the numbers were too 
small to make such comparisons with any degree of confidence and this 
must be left to future research. 

Psychotic group. At the beginning of this paper reference was made 
to two groups of variables which have been commonly thought in the 
past to differentiate the schizophrenic from normal. There is the possi- 
bility, it will be recalled, that the schizophrenic is characterized by 
excessive amounts of inhibition, or alternatively, that a lack of drive, 
or ability to sustain attention, is largely responsible for his poor per- 
formance on most tasks. Our own hypotheses directed at this group 
were, it must be admitted, formulated on the basis of an intuitive com- 
promise between the two approaches. It will be apparent to the reader 
by now that the results for the psychotic group do not emerge un- 
equivocally in support of either. In so far as the schizophrenics' 
performance on some tests is similar to that of hysterics, whom we have 
already held to be characterized by higher levels of inhibition than 
dysthymics, it can certainly be said that such results support the 
suggestion that high levels of inhibition also characterize the schizo- 
phrenic. The time error, time judgement, and vigilance tests, for example, 
could readily be analysed in these terms, although the results of the 
latter are also consistent with an explanation in terms of decreased 
motivation, or, again, as an interaction between drive and inhibition. 
Most critical, however, to an explanation in terms of excess inhibition 
alone are the low pursuit rotor reminiscence scores of schizophrenics 
and the finding that their spiral after-effect is, if anything, longer than 
normal. It is certainly possible that the reminiscence finding could be 
explained if we assume a lowered critical level due to lowered drive in 
this group. Such an explanation would then neither contradict nor con- 
firm an analysis in terms of increased inhibition. However, the spiral 
results still present a difficulty. 

Brief mention has already been made, however, of a suggestion by 
Venables (1959) that it is the rate of dissipation, rather than the 
absolute level, of inhibition which is important in the performance of 
schizophrenics. Such a concept has a number of advantages not shared 
by the two alternatives considered above and appears to merit further 
consideration. It is clear, for instance, that if it is in fact the rate of dis- 
sipation of inhibition which is critical, then the spiral result is no longer 
unusual, since the length of after-effect is uninfluenced by the speed at 
which the inhibition accounting for the phenomenon decays. It is in fact 
only the degree of inhibition induced which is critical, this inhibition 
dissipating in its own time, as it were, after the test has finished. The 
concept of differential rates of inhibitory decay has recently been applied 
to the pursuit rotor by Broadhurst and Broadhurst (unpublished paper) 
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who report similar findings with regard to reminiscence in schizo- 
Phrenies as those found here. Their suggestion is that the reason 
schizophrenics show little reminiscence is because the ten minutes rest 
allowed between phases is insufficient to allow inhibition in psychotics 
to dissipate. There are, however, a number of difficulties with this 
explanation, as the Broadhursts themselves point out. The main diffi- 
culty concerns the effects of conditioned inhibition. Since the extinction 
of conditioned inhibition (sIr) is dependent essentially on the fact that 
reactive inhibition (Ip) has dissipated during the ten-minute rest and 
can no longer reinforce the habit of not responding (sIr), then schizo- 
phrenics, not having dissipated their inhibition, should not show the 
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Phrenics in Phase I and II. Notable Features, discussed in the text, are the Straight 
Line Learning Curve and Exaggerated ‘Warm-Up’ shown by Schizophrenics. 


effects in the post-rest learning curve suggestive of the extinction of this 
habit. In other words, the abrupt rise in the post-rest learning curve, 
sald by Eysenck (1956c) to be the result of the extinction of conditioned 
qhibition, should not occur in schizophrenics. However, as can be seen 
a Fig. 2.6, there are in fact signs of an exaggerated, rather than a 
Iminished, extinction increment in the learning curve (Phase II) of the 
Psy chotics used here, although the Broadhursts’ results are somewhat 
Issimilar, 
, Â reconciliation of the two apparently contradictory findings of the 
igh sly and low reminiscence is possible, however, if we assume that 
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the time intervals for inhibitory decay are shorter than that implied by 
the Broadhursts—i.e. greater than ten minutes—but still longer than 
that required by normals. It is clear, for instance, that if schizophrenics 
dissipate inhibition at a slower rate than normals then one consequence 
of this will be that the involuntary rest pauses occurring when Ir 
reaches the critical level will be somewhat prolonged, since a greater 
time will elapse between the onset of the rest pause and the resumption 
of practice following the decay of inhibition. Theoretically there would 
be two consequences of this. First, the amount of conditioned inhibition 
subtracting from the habit factor will be proportionally greater at any 
one point in learning than in normals. Secondly, the breaks in perform- 
ance, if they are long enough, will reduce the amount of practice taken 
and thus retard the growth of habit strength. Both of these, and parti- 
cularly the first, are likely to delay improvement in Phase I and may be 
critical especially at the beginning of this phase when sHg values are 
low. This may incidentally account for the rather delayed learning 
curve shown by schizophrenics (see Fig. 2.6), although an equally mean- 
ingful explanation of this has been given by O'Connor and Rawnsley 
(1959), who suggest that the ‘straight line’ learning curves shown by 
some psychotics reflects the progressive arousal of drive in these sub- 
jects. Whatever the correct explanation of this aspect of the results, the 
position in any case would seem to be that at the end of Phase I schizo- 
phrenics are likely to have more slp than normal. Following the rest 
they would then be expected to show a large extinction increment. 
Turning to the reminiscence effect the reader is referred to the follow- 
ing modified equation, suggested by Gwynne Jones (1958) as a more 
logical representation than that presented by Hull of the components 
determining effective reaction potential or performance level: 


sËr = (D x sHg) — (D X sir) — (In X sHr) + (In X sIr) 


On the final trial of Phase I performance is determined by all four com- 
ponents in this equation. On the first trial of Phase II it is determined 
by only the first two, the components falling out—because Ig has de- 
cayed in the normal way after ten minutes—being (a) the negative 
component (Ip x sHg), this being the component that gives rise to 
the usual reminiscence effect; and (b) the positive component (Ir X sIr), 
this lowering the reminiscence effect according to its magnitude. It is 
clear that the removal of this second component will, together with the 
subtracting influence of the still remaining negativecomponent(D x sIr), 
have proportionally greater effects in schizophrenics than in normals, 
if, as we have suggested sIg has a higher value in the psychotics. The 
relatively good performance of the psychotic group would not in fact be 
revealed until this considerable amount of conditioned inhibition had 
been extinguished, and the reminiscence increment would be abnormally 
low. One other point should be made. From Fig. 2.6 it can be seen that 
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in fact no reminiscence occurs if we take the last and first trials of Phases 
I and II as our reference points, which, it will be remembered, was not 
done in taking the reminiscence measures here. (In fact, of course, the 
performance measure taken on trial 1 of the second phase is also affected 
to some extent by the necessity for the subject to warm up, suggested by 
Eysenck (1956c) to last about 10-20 seconds. This will subtract from the 
‘true’ reminiscence effect although in normal subjects it can be regarded 
as of little significance when compared with the magnitude of the reminis- 
cence changes usually found. In psychotics, however, if our previous 
analysis is correct, the subtraction of such a component is likely to 
result in a zero or even negative change in performance following a 
Test, particularly if, as seems likely, schizophrenics are slower than 
normal in warming up.) 

While the interpretation given here is necessarily post hoc it does have 
the merit of suggesting further testable hypotheses and does, further- 
more, account for all the results more consistently than the other possi- 
bilities considered. In vigilance, for example, a lower detection rate of 
signals would be the expected result of slow inhibitory decay. Indeed, 
any prolongation of the time spent away from tests of this type may be 
regarded as particularly critical, since the probability of signal detection 
during these periods is reduced to zero. A higher growth rate or absolute 
level of inhibition may, in fact, be less critical since the inhibition 
accumulating during vigilance tasks arises largely as the result of stimuli 
(in our case digits) which are irrelevant and contribute in no way to the 
Subject’s score. This may account for the slight inferiority in vigilance 
of schizophrenics compared with hysterics, in whom inhibitory growth 
Tate rather than decay rate has been more frequently emphasized. It 
is certainly true that to a greater extent on some tests than others the 
three aspects of the inhibitory process—absolute level, growth rate, and 
Speed of decay—may appear to have effectively identical or similar 
effects on measures of performance. This is exemplified by the greater 
tendency generally for the performance of schizophrenics to resemble 
that of hysterics rather than dysthymics, the notable exception to this 
being the spiral after-effect measure. By means of suitably constructed 
tests and experimental designs aimed at testing specific hypotheses it 
Should, however, be possible in future research to isolate the relevant 
aspects of inhibition which differentiate the schizophrenic from the 
hysteric and from other normal and abnormal personalities. 


THE OVERALL ANALYSIS OF RESULTS 


E ir We have been concerned with the analysis and discussion of 

t ividual tests in terms of group differences in behaviour. In this form 

: ita. provided, with a number of provisos, fairly substantial con- 

firmation of the theoretical framework within which the original 
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hypotheses were formulated. Now, however, we must turn to a con- 
sideration of the relationships between tests and attempt an analysis of 


TABLE 2.17 


Key to measures included in First and Second Analyses 


Test 


Measure and description 


SSG (intelligence) rating Sign reflected in final factor loadings. 


Vigilance 
Vigilance 


Vigilance 


Vigilance 
Vigilance 


Time Error 

Sorting Test 
Sorting Test 

Picture Recognition 
Body Sway Test 
Time Judgement 


Spiral 

AFF 

Pursuit rotor 
Pursuit rotor 
Pursuit rotor 


Pursuit rotor 
Pursuit rotor 


Pursuit rotor 


Pursuit rotor 
Pursuit rotor 
Pursuit rotor 
Pursuit rotor 
MMPI 
MMPI 
MMPI 
MMPI 

MPI 

MPI 


First part: total score. 

First part: regression coefficient ‘b’ or 
‘starting level’, 

First part: regression coefficient ‘a’ or 
‘improvement’. Sign reflected in final 
loadings. 

Second part: total score. 

Performance change from first to second 
part of test. 

Mean of error at four S-V intervals. 

Number of correct sortings. 

Log. Time. 


Total sway in both directions. 

Mean error of underestimation for four 
time intervals. 

Mean of four trials, 


RSI. 

RS2. 

% RSI or reminiscence change as a per- 
centage of pre-rest performance level. 

% RS2. 

Phase I: regression coefficient ‘b’ or 
‘performance level’. 

Phase I: regression coefficient ‘a’ or ‘rate 
of improvement’. 

Phase II: regression coefficient ‘b’. 

Phase II: regression coefficient ‘a’. 

Phase III: regression coefficient ‘b’. 

Phase III: regression coefficient ‘a’. 

HY Scale score. 

PT Scale score. 

PD Scale score. 

SC Scale score. 

E-scale score. 

N-scale score. 


the main factors underlying them. Since the principal theme running 
through this investigation has been that individua] differences on the 
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wide variety of measures employed are explicable in terms of variations 
| in underlying inhibition, then it would be expected that an ‘inhibitory 
| factor’ would emerge from the intercorrelations between the tests and, 
further, that this would be identifiable with the behavioural dimension 


TABLE 2.18 
Unrotated factor loadings for First Analysis (48 subjects) 
Test Factor 
I IL HI IV 

0 —0:49 —0:35 —0441 —0-06 

1 0:34 0:48 021 0-64 

2 0-29 0:38 0:74 0-06 

3 —0:05 —0:03 —0:72 0-50 

4 0-63 0-14 0-19 0:18 

5 024 —0:31 0:41 —0:56 

6 0-13 0:36 -002  -002 

7 0-28 0-26 016  —O13 

8 —0-08 —0-01 014 001 

9 —021 —0:06 —002 . —025 

10 —0:36 —0:05 0:04 —0:07 

11 0-45 0-22 0:28 —0:14 

12 —0:07 0:54 —044 0:09 

| 13 0:46 0-04 0-11 —0:33 
14 0:43 0-01 —0:38 —0:53 

15 0-42 —0:18 —0:25 —022 

16 0-07 —0:26 —0:26 —0:32 

17 —0:18 0:04 0-41 —0:08 

18 044 0:43 —0:10 0:32 

19 0-12 —0:01 —004  —021 

| 20 0:73 0:43 —0-23 0:03 
| 21 —0:56 —0:32 0:03 0-09 
22 0:73 0-16 —0:47 —0:04 

23 —0:55 0:03 042 0:34 

24 —031 0-67 006 —018 

25 —0:46 0-74 —0-08 —028 

26 —0:23 0:62 0:07 —0:37 

27 —0:48 0:65 —0:06 —0:38 

28 0:38 —0:56 0:07 —0-22 

29 —033 049  -—038 016 


pa MAU gc Of particular interest in this respect was the possibity 

differenti} a link between inhibition and satiation according to the 

n Nh al method of proof discussed earlier. 

carried el to investigate this, two principal comp 
1Th " on the data, correlations being calculate 

carried (itt is indebted to Mr. Benedict Nixon of the Psychology Department who 

Out these analyses for him on the Ferranti Pegasus Electronic Computer. 
143 


onent analyses were 
d between thirty 


Gordon Claridge 


selected measures. These measures are indicated in Table 2.17, num- 
bered to correspond with the entries in the tables of factor loadings, and 
briefly described where appropriate. The first analysis was carried out 
on the scores for the hysteric, dysthymic, and normal groups only, there 


TABLE 2.19 
Rotated factor loadings for First Analysis (48 subjects) 
Test Factor 
I Il ni* IV* h? 
Ot 0:73 0:0 0:0 —0:07 0-538 
1 0:57 —0:06 0:13 —0:67 0-794 
2 0:79 0-12 —0:36 —0312 0782 
3t 0:45 0:20 —0:56 0.46 0768 
4 0-60 —0:28 0:18 —0:07 0:476 
5 0:25 —0:13 —0:32 0:67 0-631 
6 0-25 0-22 0-19 —0:07 0:152 
7 0:40 —0-12 0-08 0-09 0-189 
8 0:02 0:04 —0-15 0-0 0-024 
9 0-19 0-16 —0-10 020 0112 
10 —0:25 0-17 —0-22 0-0 0-140 
11 0:57 0-0 0-05 015 0:350 
12 —0:05 0:41 0:50 —0:28 0-500 
13 0:40 —0:07 0-14 0:39 0:337 
14 0:09 0-0 0:52 0-58 0:615 
15 0:07 —0:26 0:32 0:36 0:304 
16 —0:21 —011 0-17 0-40 0245 
17 0-12 0-15 —041 002 0205 
18 0:44 —0:02 0:42 —0:34 — 0:486 
19 0-05 0:02 0-08 022 0-058 
20 0-58 0:05 0-67 —0:01 0:788 
21 —0:51 0-0 —0:38 —0:10 0:414 
22 0:32 —0:22 0-78 014 0779 
23 —0:13 0:15 —0:59 —044 0-581 
24 0-12 0:74 0:0 —0:13 0:579 
25 —0:03 0:91 0-08 —0:10 . 0:845 
26 0-16 0-74 0:02 0:07 0:578 
27 —0:07 0:89 0:02 002 0-798 
28 0-05 —0:53 —0:05 0-48 0:516 
29 —0:22 0-60 0:31 —0-09 0:513- 


* Indicates that these factors have been reflected (see text). 
T Indicates that these tests have been reflected (see text). 


being 48 subjects in this case. The same measures from all 64 subjects 

were then included in a second analysis. Four components were 

extracted from the correlation matrix of each analysis. The elements of 

the latent vectors appropriate to each component are considered here 

in their factor-loading form, the loadings for the first and second 
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analyses being shown in Tables 2.18 and 2.20 respectively. In order to 
achieve psychological meaning from the components orthogonal rota- 
tions were performed on both sets of data. The rotated factor loadings 
for the analysis carried out on 48 subjects are shown in Table 2.19 and 
the corresponding values for the second analysis in Table 2.21. It should 
be noted that in Table 2.19 the signs of all loadings for Factors HI and IV 
have been reversed to. facilitate interpretation and the comparison 
between the two analyses. Similarly, in both Tables2.19 and 2.21 the signs 
for tests 0 and 3 have been reflected, again to facilitate interpretation. 
Test 0 (the SSG or intelligence rating) was, it may be remembered, 
Scored in such a way that a high SSG referred to low intelligence level. 
In the case of Test 3 the regression coefficient ‘a’ is more appropriately 
Considered in the context of the vigilance test as a measure of rate of 
fall-off, rather than of rate of improvement. The analysis carried out on 
the normal and neurotic groups only will be considered first and the 
reader is referred again to Table 2.19. 
, The first factor extracted can fairly clearly be identified as a factor of 
Intelligence, the loading of the intelligence measure itself (Test 0) being 
0773, and the Sorting test (Test 7) having, as might be expected, a loading 
of 0:40. Although this factor is not in itself of immediate interest it 
should be noted that a number of other measures, not so obviously 
Associated with intelligence, also have relatively high loadings on this 
factor. Particularly noticeable in this respect are some of the vigilance 
measures (Tests 1-5 inclusive). Vigilance starting-level (Test 2), for 
instance, has a higher loading on this factor than any other test, viz. 
9-79. One possible explanation of this is that within the relatively wide 
Tange of intelligence covered here the inability of the duller subjects to 
grasp immediately the concept of odd versus even numbers was picked 
is tin their poor performance at the beginning of the test. It is possible, 
erefore, that intelligence influences vigilance performance only up to 
certain level of brightness but not beyond it. The Time Judgement test 
e 11) can also be seen to have a moderately high loading of 0:57 
Ue ae intelligence factor and zero or negligible loadings on the other 
ne factors, Equally disappointing is the Time Error test (Test 6), the 
;ghest loading of which is 0-25, again on intelligence. Auditory Flutter 
ig (Test 13) seems also to be partly determined, in this sample 
ma nur intelligence, its highest loading (0:40) being on this factor. t 
t w^ © significant that the last three tests have as an element 1n = 
hich i that they demand from the individual a degree of jadgemes 1 
inn » Intuitively at least, one might expect to be subject to Pt 1 d 
oa eh Speaking generally, it may be true that some of the hig) 
ue ings on Factor I of tests not obviously related to intelligence ar, 
su .1n part to the range of ability present in the sample, a pos E 
ps pels being somewhat duller than those normally employed in 
Ychological experiments. 
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Factor II can also be interpreted fairly clearly. It is mainly determined 
by the questionnaire measures from the MMPI and MPI (Tests 24-9 
inclusive) and is similar to that reported, for example, by Eysenck, 
Granger, and Brengelmann (1957), who called their factor ‘intro- 
spective abnormality’. In the present case it may be more specifically 
identified as ‘neuroticism’, most of the tests with high loadings on the 
factor being measures of this dimension. Few of the objective tests have 
appreciable loadings on the factor, although the Spiral test (Test 12) 
with a loading of 0:41, is a noticeable exception. This test can also be 
seen to have loadings on the third and fourth factors and would seem, 
therefore, in view of our predictions about the test, to merit further dis- 
cussion. This will be delayed, however, until the other factors have been 
described. 

Factor IV is most conveniently considered next as this too seems to 
be fairly well defined. In fact, it may be interpreted with some confidence 
as the factor of ‘extraversion’ which we predicted would emerge from 
the analysis, the loading of the E-scale (Test 28) being 0-48. Its identifi- 
cation with an underlying factor of ‘inhibition’ is supported by a 
number of loadings on the objective tests. The loadings of —0:67 and 
0:46 shown by Tests 1 and 3, for example, indicate that poor overall 
vigilance performance and rapid fall-off on this test, both indicative of 
high levels of inhibition, are positively associated with extraversion. 
Similarly Test 5 (Vigilance change) has a loading of 0-67 on this factor 
and confirms our hypothesis that disinhibitory effects will be greater 
among extraverts, although the moderate loading of this measure on 
the third factor should also be noted. Turning to the other tests RSI 
(Test 14) and RS2 (Test 15) have loadings of, respectively, 0-58 and 0:36, 
reminiscence, as predicted, being higher in extraverts, although again 
there are appreciable loadings on Factor III. AFF (Test 13) has a load- 
ing of 0:39 on the factor, but the Time Error and Time Judgement tests, 
with loadings of —0:07 and 0-15 respectively, fail to confirm our expecta- 
tions that the group differences found on these two tests would be 
explicable in terms of the link between inhibition and extraversion. 
Indeed, the very low communality of Time Error (Test 6) indicates the 
very small extent to which the four factors extracted account for per- 
formance on this test. Partial confirmation of the similarity between 
satiation and inhibition is forthcoming in the loading on extraversion 
of the Spiral test, but the value of —0-28 is noticeably lower than the 
same test's loading on other factors, particularly Factor III to which 
we now turn. 

Interpretation of the third factor is less certain than in the case of the 
other three factors. It has loadings of —0:56 on Vigilance Fall-off, 0:52 
and 0-32 on RSI and RS2 respectively, 0-50 on the Spiral, and 0-42, 
0-67, and 0-78 on the performance level measures for the three phases 
of the pursuit rotor experiment. In addition, the N-scale of the MPI 
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(Test 29) has a moderate loading of 0:31. The pattern of loadings 
Suggests that the most likely interpretation is that this is a ‘drive’ or 
motivational’ factor. Undoubtedly supporting this are the high positive 
loadings of pursuit rotor performance level, of which one of the major 
determinants is certainly drive. Similarly, low degrees of fall-off in 
Vigilance may be meaningfully interpreted as being closely associated 
with high drive levels, while, as we have already seen, the latter is also 
likely, other things being equal, to result in increased amounts of 
reminiscence, Finally, the moderate loading of the N-scale on this factor 
lends support to the interpretation, in so far as neuroticism as measured 
by this scale can be said to have drive properties. It is interesting in this 
respect that the MMPI measures determining Factor II, which we have 
interpreted as a factor of general neuroticism, have virtually zero load- 
ings on Factor III. It is just possible that the N-scale taps something 
More specific, such as anxiety. This would certainly link up the generally 
lowered N-scale scores and poorer objective test performance of hyster- 
ics with the suggestion, made earlier, that symptomatic anxiety may be 
9ne of the important determinants of the dysthymic-hysteric differences 
found here, 

The probability that Factor III can be meaningfully interpreted as 
One of drive certainly lends general support to an hypothesis that 
Variations in motivation may have accounted for some of the inter- 
Broup differences reported and it is interesting that a factor of this kind 
shou ld have emerged which confirms the previous interpretation of the 
individual test results, 
is ne relevant aspect of the analysis requires further comment and that 

the result of the Spiral test. It can be seen that Factors III and II 
Provide the highest loadings for this test, its loading on extraversion 
Piu. somewhat lower than our original hypotheses would have led us 

i, CXPect. In other words, in this sample increases and decreases in 


drive may be said to have had the effect of, respectively, lengthening 


and shortening the spiral after-effect over and above the latter's nega- 
anation of this is 


Eu Correlation with extraversion. One possible expl f 

at increased motivation, in terms of greater task-involvement, induces 
ae to fixate more tenaciously, while, conversely, low ved 
will 5 him to indulge in eye and head movements," which, becaus ui 
(H produce frequent shifting of the fixation point, are likely to re: 
itio land 1957) in a reduction of the after-effect. In other bound 


t H 
į Known (Holland 1958) that, within wide limits, the length o 


a A - 5 A 
thie fect Increases directly with the amount of previous stimulation, 


en it is possible that reduced drive lowers the after-effect simply by 


on, is that these eye 


ntaneous rest pauses 


1 
An j : NEM 
movem, interesting speculation, worthy of further investigati 
le played by 


nas are th f the spo 
Occurrin, ; he parallel at the perceptual level of the sp 
drive ine In continuous work. If edm i exact determination of the ro! 
Perceptual tasks would be of particular importance. 
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lowering the degree of stimulation giving rise to it. On the other hand, 
it is likely that particularly gross and frequent movements, which remain 
to be demonstrated, would be necessary to account for the magnitude 
of the effect. 

Another possibility is that the loading on Factor III simply confirms 


TABLE 2.20 , 
Unrotated factor loadings for Second Analysis (64 subjects) 
Test Factor 
I Il Til IV 
0 -0:66 -0:25 0:33 -0:03 
1 0-71 0:29 -0:15 -0:39 
2 0:70 0:26 -0:35 0:02 
3 -0:18 -0:03 0:37 -0:57 
4 0-75 0-10 -0-27 -0-06 
5 0-12 0:20 —0:49 0-64 
6 0:42 0-15 0:07 -0:21 
7 0-54 0-25 -0-18 0:13 
8 -0:13 0:04 -0:41 0-13 
9 -0:30 0-05 -0:02 0-11 
10 -0:19 0:13 0-12 -0:12 
11 0:35 -0:08 0-04 0-0 
12 -0:03 0:30 0:35 -0:37 
13 0-48 -0:07 0-06 0-22 
14 0:24 -0:19 0-58 0:44 
15 0:37 -0-29 0:45 0-16 
16 -0:17 -0:27 0:36 0:34 
17 -0-15 0-01 0-03 0-26 
18 0:53 0:25 0:0 -0-07 
19 0-18 0-02 -0:37 0-03 
20 0:72 0:17 0-12 0-09 
21 -0:60 -0:20 -0:36 -0:20 
22 0:70 -0:14 0:32 0-06 
23 -0-56 0-18 -0:56 0:18 
24 -0.05 0:74 0:10 0:13 
25 -0:28 0:86 0-10 0-21 
26 -0:20 0:62 0-05 0:44 
27 -0:42 0-75 0-11 0:30 
28 -0-11 -0:57 -0:13 0:39 
29 -0-19 0:57 0:24 0-03 


the suggestion made earlier that drive, in so far as it may be identified 
as a central excitatory process, would be expected to alter in some un- 
specified way the balance between excitation and inhibition which we 
have assumed to underlie the spiral after-effect phenomenon. It is, how- 
ever, impossible on the present evidence to decide which of the various 
possible explanations is correct. 
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The second analysis carried out on all 64 subjects reveals, as can be 
seen from Tables 2.20 and 2.21, a very similar factor pattern, although 
there are a number of discrepancies worth noting. All four factors, with 
the possible exception of Factor III emerge equally clear-cut as in the 


TABLE 2.21 

Rotated factor loadings for Second Analysis (64 subjects) 

Test Factor 

I II I IV h? 
0* 0-77 0-04 0-0 00 0594 
1 0-75 —0:14 0-01 —042 0759 
2 0-83 0-04 0-0 00 0690 
3* 0:32 026 | —002 0-56 — 0484 
4 0-78 —0415 007  -—001 04636 
5 0:25 012 . —010 0-78 0:695 
6 0:38 —0:10 0-15 —028 0:255 
7 0-61 0-13 011 010 0411 
8 0-07 013 — —035 024 0202 
9 —0:24 0-18 0-13 —0:03 0-108 
10 —0:18 010 — —004 . —019 0-080 
11 0:26 —0:16 0-19 0-01 0-129 
12 —0:09 0:07 0-11 —0:57 0:350 
13 0:36 —0:07 0:34 017 0279 
14 —0-09 0-0 0-76 021 0:630 
15 0:05 —0:26 0-61 0-08 — 0449 
16 —0:38 0:0 0:36 026 0342 
17 —045 0:25 0:03 016  O1ll 
18 0:53 0-03 0-18 —0:20 0354 
19 0:32 —0:02 0:26 —006 0174 
20 0:62 0:0 0:43 —0-08 — 0:576 
21 —042 —0-10 —0:46 0.40 — 0:558 
22 0:42 —0:27 0:60 0-02 0-610 
23 —0:20 020 | —078 0-0 0-688 
24 0-12 0-69 005 —030 0583 
25 —0:04 oss  —0:03 028 0855 
26 —0-02 0-78 0-07 003 0-615 
27 —0-20 0-88 —0:04 016 0842 
28 —0-20 —0:25 0:03 0-64 — 0:513 
29 —0:10 0:53 0:09 —034 0-415 


* Indicates that these tests have been reflected (see text). 


first analysis, As before, the intelligence rating (Test 0) has a high load- 

ing of 0-77 on Factor I, again supporting the interpretation that this is 

an ‘intelligence’ factor. As expected, the introduction of the psychotic 

ep has the effect of increasing the loading of the Sorting test (Test 7) 

a first factor from 0:40 to 0:61. The loadings of this test on the 

er factors are, however, virtually unchanged. As before, the vigilance 
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measures, particularly Tests 1, 2, and 4, are prominent in determining 
this factor, the loadings being somewhat higher than in the previous 
analysis. Also somewhat higher are Time Error (Test 6) with a loading 
here of 0:38 and the three measures of pursuit rotor performance level 
(Tests 18, 20, and 22) with loadings of 0-53, 0-62, and 0:42 respectively. 
These high loadings on Factor I are almost certainly due to the fact that 
reduced effective intelligence and poor performance on objective tests, 
such as pursuit rotor and vigilance, are more closely bound up with 
severity of illness in psychotics than is the case in other psychiatric 
groups. 

Factor II can again be interpreted as a ‘questionnaire neuroticism’ 
factor, the loadings of the MMPI scales (Tests 24-7 inclusive) and the 
N-scale (Test 29) being almost identical with those in the first analysis. 
The E-scale (Test 28) loading, however, is reduced from —0:53 to —0:25. 
No objective test has a loading greater than 0-30 on this factor, the 
loading of the Spiral being reduced from 0-41 to virtually zero. It is 
interesting that, contrary to expectations, the Body Sway test (Test 10) 
had low loadings throughout both analyses, the loadings expected on 
Factor II being only 0-17 and 0:10 in the first and second analyses 
respectively. There seems no evidence, therefore, in the present investiga- 
tion for Body sway as a test of general neuroticism. Picture Recognition 
comes off equally badly, the factors in both analyses accounting for only 
a small part of the communalities. 

Factor IIL is less easily identified with the corresponding factor in the 
first analysis, its interpretation as a ‘drive’ component being somewhat 
more tenuous. The previous loadings on this factor of the vigilance 
measures, particularly Vigilance Fall-off (Test 3), are reduced virtually 
to zero, as is the loading of the N-scale. High loadings remaining, how- 
ever, are those of the two reminiscence measures (Tests 14 and 15) with 
values of 0-76 and 0:61 and two of the pursuit rotor level measures 
(Tests 20 and 22) with loadings of 0:43 and 0:60. The loadings of AFF 
(Test 13) is increased from 0-14 to 0:34, and that of Sorting Log. Time 
(Test 8) from 0-08 to —0-35, although both of these are rather difficult 
to interpret. In so far as a number of measures have consistently high 
loadings in both analyses, there seems some justification for retaining 
the interpretation of this factor as one of ‘drive’. It is not unexpected 
that the inclusion of the psychotic group, which accounts for one 
quarter of the sample, should have altered some of the relationships 
between tests. Apart from the sample as a whole being less homogeneous 
than previously, the effect is further exaggerated by the wide differences 
found on most measures within the psychotic group itself. Indeed, it is 
encouraging that the two analyses should have diverged so little, — 

Factor IV can, as before, be identified as one of ‘extraversion’, its 
loading on the E-scale (Test 28) being 0-64, which is somewhat higher 
than previously, although its loading on the N-scale is also increased 
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from near-zero to —0-34. Of the objective tests the vigilance measures, 
Tests, 1, 3, and 5, retain their high loadings on this factor, while the 
Spiral (Test 12) emerges with a more clear-cut relation to extraversion. 
Its loading of —0-57 is over twice as high, while the loadings noted 
previously on the other factors are reduced considerably. The most 
marked discrepancy here is the reduced influence of reminiscence in 
determining the factor,+the previously high loadings of RSI and RS2 
(Tests 14 and 15) decreasing to 0-21 and 0-08 respectively. This finding 
as perhaps not surprising when it is recalled that the overall level of 
reminiscence is reduced in schizophrenics, an effect which is not unlikely 
to mask the relationship of this measure with extraversion. It possibly 
confirms, as suggested earlier, that the similarity between the schizo- 
phrenic’s performance and that of the extravert is to some extent a 
function of the measures we take, while fundamentally two different 
Processes or two different aspects of the inhibitory process are operating. 
To summarize the results of the two analyses, it can be said that, as 
predicted, some of the individual differences found in the group com- 
parisons can fairly confidently be attributed to a personality factor 
identifiable as extraversion. Furthermore, it is true to say that this 
factor bears a systematic relationship to performance on à number of 
Objective tests, in which central inhibition can reasonably be held to be 
a relevant underlying variable. It may be concluded from this, therefore, 
first, that inhibition as an explanatory concept has a general applicability 
to a number of different measures of behaviour; and secondly, that 
extraversion is an important behavioural concomitant of this under- 
lying inhibition. A second important feature of the analysis is the 
evidence forthcoming that differences in what has loosely been called 
drive appear to play some part in determining performance on à number 
of tests. This, it has been argued, arises principally from differences on 
this variable between the two neurotic groups, who form one half of 
the sample. The demonstration of such a factor tends to confirm the 
Possibility that some of the group differences in performance are 
accountable for in terms of an effective shift in neurotics in the balance 
of excitation-inhibition underlying introversion-extraversion. — 
Finally, turning to the rather less positive findings reported in this 
Section, a number of other important facts emerged which tended to 
be masked by the group comparisons. For example, despite very 
starked group differences exactly in the direction predicted, Time a 
Shows little relationship with extraversion, or indeed with any of the 
other factors extracted. Either time error is not mediated by the r 
eed Process in the way we suggested or our results failed to de 
Pe hems for some other reason. Before it would be possible to (€ m 
: inite conclusion it would be necessary to repeat the —— ces 
hee would seem worthwhile in view of the apparent relatio: : 
'p between time error and personality factors. A similar conclusio 
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applies to Time Judgement, where the significant correlation found in 
the normal group between extraversion and time estimation suggests a 
real relationship in need of further investigation. To some extent the 
opposite trend was true of the Auditory Flutter Fusion. Here the differ- 
ence between hysterics and dysthymics was negligible but some relation- 
ship with extraversion emerges from the overall analysis. It is, however, 
in the direction opposite from that predicted and from the results 
gathered on normals by Holland and reported by him elsewhere in this 
book. As we saw in discussing this test, it seems not to discriminate 
between the two types of neurotic but rather between neurotics as a 
group and normals, the latter achieving higher AFF scores. Since the 
neurotic groups taken together are less extraverted than the normal 
group, it is inevitable that positive, rather than negative, correlations 
should occur between AFF score and extraversion. Nevertheless, it is a 
discrepancy requiring an explanation which the present results fail to 
provide. Loadings on intelligence and, in one case, on drive are difficult 
to interpret in a meaningful fashion, although a negative correlation 
with motivational factors is not irreconcilable with the lowering of 
attention which we might expect to be present in some neurotics and 
psychotics and to be accompanied in these subjects by poor discrimina- 
tion on tests of this kind. Whatever the correct interpretation, dis- 
crepancies such as these clearly suggest the need for further investigation 
into what appears to be an important area of personality research. 


SUMMARY AND CONCLUSIONS 


An attempt has been made to examine the validity of Eysenck’s 
‘inhibition hypothesis’ when applied to a variety of tests falling broadly 
into the categories of perception, attention, and learning. The major 
aim of the investigation was to test specific hypotheses arising from 
Eysenck’s theory that groups of dysthymic, hysteric, and normal sub- 
jects would behave on these tests in a manner predictable from their 
respective positions on the personality dimension of introversion- 
extraversion. Comparisons between the behaviour of these groups and 
that of a group of schizophrenics also included in the design were 
expected to suggest the extent to which the behaviour of psychotics was 
explicable within the terms of Eysenck’s theory or suitable extensions of 
it. In addition, an examination of the interrelationships between test 
results gathered on the same population would, it was anticipated, 
indicate the generality of the inhibition hypothesis as an explanatory 
concept for extraversion. 

Results from the tests taken individually confirmed, with very few 
exceptions, the hypotheses proposed in each case, hysterics and dysthy- 
mics behaving in a manner suggestive of, respectively, high and low 
levels of central inhibition. Normal subjects tended to fall between these 
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two groups, while the performance of psychotics was more like that of 
hysterics than that of any other group. Thus, in so far as the various 
hypotheses were confirmed, Eysenck’s theoretical position could be said 
to have been vindicated. However, the relatively low, and unpredicted, 
level of extraversion in the hysteric group, and a number of aspects of 
the performance of dysthymics, indicated what appeared to be an 
exaggeration in the neurotic groups of the inhibitory effects predictable 
alone from the degree of extraversion evident in these subjects. The 
possibility was discussed that drive differences between hysterics and 
dysthymics may have accounted for the effect. The results for the 
psychotic group tended to support the view that a slow rate of inhibitory 
decay may be more characteristic of the schizophrenic than the high 
levels of (protective) inhibition more often invoked in studies of schizo- 
phrenia. Despite some support from other recent studies such a con- 
clusion must be regarded more as a source of further testable hypotheses 
than as a statement of theory. 

In order to investigate the interrelationships between the various 
tests, two principal component analyses were carried out on thirty 
measures derived from the objective tests and from personality question- 
naires. The first analysis utilized the normal and neurotic groups only, 
while all subjects were incorporated in the second analysis. Both agreed 
in demonstrating four meaningful factors, the first two being identifiable 
in each case as ‘intelligence’ and ‘questionnaire neuroticism’. Of the 
remaining two, one could be fairly clearly identified as a factor of 
extraversion'. This was found to be related, as predicted, to a number 
of measures from the objective tests, thus lending further support to the 
generality of Eysenck’s hypothesis. Support was also found for the 
suggestion that motivational influences may have accounted for some 
of the inter-group differences, a factor being present which was identi- 
fiable as one of ‘drive’. 

A number of conclusions may readily be drawn from the results as 
a whole, but perhaps the most interesting, and certainly the most 
important, aspects of the investigation are the questions posed by it 
rather than the answers it provides. It is encouraging, but not un- 
expected in view of previous research, that many of our hypotheses 
should have been confirmed or that further evidence should have been 
found of a link between extraversion and central inhibition. Clearly, 
however, one of the major questions raised by the results concerns the 
Possible influence on performance, and perhaps, therefore, on the 
inhibitory processes underlying performance, of factors other than 
extraversion. By selecting neurotic groups in an attempt to represent 
the extremes of extraversion-introversion it seems likely: that un- 
Wittingly we introduced, as the evidence in fact suggests, other important 
Variables. Of these the most relevant is certainly the drive variable 
isolated in the analysis of results. The implications inherent in this 
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finding seem to have important bearing on the link that has been made 
between extraversion and inhibition, since there seems to be a real 
possibility of an actual or effective shift in inhibition and excitation 
levels in neurotics. Differences at the autonomic level between dysthy- 
mics and hysterics may be meaningful in this context, since a link is then 
feasible between cortical inhibition and drive as a central arousal or 
excitatory process. " 

Probably more questions than answers were forthcoming from the 
results for the psychotic group, although the general similarity between 
hysterics and schizophrenics on many tests seems to be one important 
finding which emerges. While not in itself unexpected, it does, neverthe- 
less, help to delimit the area for future research. It is, of course, the 
differences rather than the similarities between hysterics and psychotics 
which are of greatest importance and the possibility of identical effects 
on performance of what may be different aspects of the inhibitory 
process suggests the need in future studies for carefully planned 
experimental designs and techniques. 

In conclusion, it may be said with some confidence that the results 
of the investigation as a whole undoubtedly bear witness to the need 
for the experimental study of personality to be carried out within a 
theoretical framework which is not only capable of generating testable 
hypotheses, but which is also modifiable in the light of new facts 
emerging from the investigation of these hypotheses. That this has not 
always been the case in personality research in the past is an explana- 
tion, if not a justification, for some of the uncertainties apparent in the 
findings presented here. 


154 


THE EXCITATION-INHIBITION BALANCE 
* IN NORMALS 
(This study was made possible by a grant 


from the Medical Research Council, to 
whom we wish to express our thanks.) 


MEASURES OF LEARNING AND 
CONDITIONING 


R. A. Willett 


GENERAL INTRODUCTION 


THE investigation being reported in this chapter was initially planned 
as an attack on the problem of generality amongst various measures 
obtained from different conditioning procedures. An additional ques- 
tion of central moment was that of how regularly different conditioning 
measures related to the personality parameters introversion/extraver- 
Sion and neuroticism. It will be recalled that a good deal of research 
has been carried out into the problem of how eyeblink conditioning 
relates to these two parameters by Eysenck and by his associates (Franks, 
1956; Franks, 19572; Franks, 19575; Eysenck, 1957). It had been hoped 
to extend that research in this present investigation by utilizing condi- 
tioning techniques other than eyeblink conditioning. In fact, this aim 
has been only partly realized because on closer acquaintance with what 
Were techniques new to the investigators, their status as conditioning 
Processes seems somewhat dubious as will become clear in the later 
discussion of the results. 

The basic conception related introversion/extraversion to a postu- 
lated neurological substruct, the particular balance of inhibition/excita- 
tion tendencies characterizing the central nervous system of a particular 
individual. It was suggested that a state in which inhibitory tendencies 
are apt to predominate characterizes the extravert while the obverse 
characterizes the introvert. Now clearly the most parsimonious corol- 
lary to this hypothesis would allow of only one mode of activity for 
such a substruct. In other words the simplest view of the hypothesis 
Would be that characteristic properties of particular nervous systems 
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would manifest themselves in the same way whenever the appropriate 
conditions presented themselves. Translating this argument, rather 
freely, into behavioural terms, we would expect a significant degree of 
correlation amongst different conditioning measures and further, we 
would expect all such measurements to be correlated significantly with 
a reliable measure of introversion/extraversion. 

The terms of the initial research were broadened at a later date to 
incorporate processes other than conditioning. The notion of generality, 
implicit in the argument applied to conditioning, was extended to other 
test situations in which the inhibition/excitation balance seemed to 
play an important role (see chapter by Holland). The expectation then 
became that all relevant measures would correlate together and with 
extraversion. In view of the expansion of the original study, this wider 
aspect of generality, which has become associated with this investiga- 
tion, has been given a separate treatment by Eysenck in Chapter 5. In 
that chapter the results of a principal component analysis, carried out 
on the whole matrix of intercorrelations of all tests administered in the 
work being reported here and in that reported by Holland, will be 
presented, together with some matters of the widest implications. 

We shall here be concerned with the data from the ‘conditioning’ 
procedures without making specific reference to the problem of general- 
ity. We shall centre our interests on a number of crucial correlation 
co-efficients which have a direct bearing on expectations derived from 
the basic hypothesis about extraversion and the inhibition/excitation 
balance and we will not anticipate the ‘second-order’ type of analysis 
presented in the later section. 

Because some rather new techniques were to be employed and, be- 
cause some were new to us, a pilot study preceded the main investiga- 
tion. Although we do not intend to give a full account of this we shall 
be referring to it from time to time and shall therefore include in our 
description of the experimental details any additional information neces- 
sary to outline this pilot study. In both the pilot and the main investiga- 
tions, on the basis of the hypothesis connecting the inhibition/excitation 
balance with introversion/extraversion, it should be remembered, we 
expected that appropriate ‘conditioning’ measures would themselves 
intercorrelate and would correlate with a measure of this personality 
dimension. In point of fact our primary concern, in the pilot study, was 
to try out the unfamiliar technique and to examine only in a preliminary 
way, their relationship with introversion/extraversion. 


SUBJECTS 


Twenty male students from the University of London’s Institute of 

Education formed the subjects of the pilot study. Of these, three had 

to be rejected through apparatus failure. The mean age of the remaining 
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seventeen was 29-17 + 6:28 years. These subjects answered an appeal 
for volunteer subjects made to Institute classes and received their 
expenses and a fee for their attendance. 

The subjects of the main investigation were 100 students from the 
apprentice training school of a large industrial concern. Students at the 
school are selected by standardized test procedures and on the basis of 
their general school record. They are subdivided into two groups: a 
craft’ group, whose members undergo a training in a skilled trade, and 
a ‘student’ group whose members study to qualify in some branch of 
mechanical or electrical technology. Both sub-groups are represented 
in our subject sample. Their ages ranged between sixteen and nineteen 
years, On the National Institute of Industrial Psychology’s Test 90A 
they scored a mean of 822 + 13-71 which, comparing these scores to 
those of an unselected group of secondary school children, places them 
towards the top of the middle 40% of pupils. In comparison to a Gram- 
mar school population they are placed somewhat below the middle of 
the same 40%. In both investigations, subjects with obviously relevant 
physical defects—e.g. any degree of deafness, colour blindness, etc.— 
were excluded, as well as those with any history suggestive of brain 
damage or any undergoing medication. Very few cases from amongst 
the trainee group had to be excluded on these grounds since their 
school selection had already screened out boys with major disabilities. 
However, a number were lost through apparatus failure. Finally, eighty 
subjects who were tested on all the procedures were obtained. 


GENERAL PROCEDURE 


Because the investigation being reported here was part of a larger 
whole, certain aspects of the overall procedure must be mentioned. 
The total testing time occupied six hours of one day, excluding appro- 
Priate resting periods. Two schedules of testing were devised which 
dovetailed so that two subjects were seen on each day. During the rest 
periods the subjects either read magazines or filled in the personality 


questionnaires. These latter comprised: 


(i) the Maudsley Personality Inventory (MPI), yielding a score on 
, Extraversion (E) and a score on Neuroticism Ne 
(ii) two forms of Cattel’s contact Personality Inventory which seems 
___ to be measuring Extraversion (©) and a lie scale (D). . 
(iii) MMPI scales (Hathaway and McKinley, 1951), Hy, Hysteria; 
Pt, Psychaesthenia; Pd, Psychopathic deviance and K, the faking 
good, faking bad factor; these together seem to cover the areas 


* Eysenck, H. J., Manual of the Maudsley Personality Inventory, Univ. of London 


Press, 1959, 
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of extraversion and neuroticism (Franks, Souief, and Maxwell 
1959). 


The general intelligence test scores reported (test 90A of the National 
Institute of Industrial Psychology) were obtained from the apprentice 
training school records of the subjects. 

Four techniques were used in the investigation being reported here. 
They were (a) Eyeblink Conditioning, (6) Salivary Conditioning, (c) 
Verbal ‘Conditioning’ and (d) Spatial ‘Conditioning’. Half the subjects 
were put through the procedures in the morning and half in the after- 
noon. Each subject was seen by this investigator for about three and 
a half hours. The same order of administration was followed through- 
out the whole investigation. We shall deal with each procedure 
separately. 


EYEBLINK CONDITIONING 


Eyeblink Conditioning was included in this investigation on the basis 
of the previous work done by Franks (1956, 1957b) and by Das (1957). 
In his original study, Franks (1956) was able to discriminate a group of 
dysthymic patients (tense, anxious introverted neurotics) from a normal 
group and a group comprising hysterics and psychopaths. The hospital- 
ized subjects were classified into groups according to twin criteria, a 
questionnaire measure and a rating by the relevant consultant psychia- 
trist on the basis of a check list of behavioural characteristics. When 
this rather careful selection procedure was carried out, it was felt that a 
fair degree of reliability in classification could be obtained and the 
hypothesis mentioned in the introduction could be tested. It was pre- 
dicted that, if inhibitory tendencies predominate in the extraverts (the 
hysteric-psychopath group) then this group would condition less readily 
than the introverts (the tense, anxious dysthymics), in whom excitatory 
tendencies are held to predominate. This prediction was verified at an 
acceptable level of statistical significance and from this point Franks 
proceeded to test similar predictions in a variety of situations, including 
a confirmation of his finding with a normal group (Franks and Laverty, 
1955, Franks, 1957, 1957a, Franks and Trouton, 1958). In general Franks’ 
later studies have borne out his earlier findings. 

Das (1957) was primarily concerned with the relationship between 
suggestibility, susceptibility to hypnosis and eyeblink conditioning. He 
also administered the Maudsley Personality Inventory, as did Franks 
in his study of normal subjects, and hence, the work of Das provides 
valuable comparison data with Franks’ study. In fact, Das did not dupli- 
cate Franks’ result and could report only a correlation of —0:083 be- 
tween extraversion and conditioning whereas Franks had found a co- 
efficient of —0-46 (Franks, 1957). This contradiction in results will be 
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commented upon more fully in the later section in which the results 
of this investigation are discussed. 

One other very important body of relevant work should be mentioned, 
that of Spence and Taylor and their associates (Spence and Taylor, 
1951, 1952; Spence and Farber, 1953; Farber and Spence, 1953). Much 
of this work has been concerned with the relationship of ‘Drive’ to 
eyeblink conditioning where ‘Drive’ is present in the form of manifest 
anxiety, as measured by the Taylor Manifest Anxiety Scale (MAS). The 
researches of these workers derived from the theories and writings of 
Hull (1940, 1943, 1952) and of Pavlov (1927) very much as do those of 
Franks and that being now reported. The main difference between the 
two sets of studies is one of aim. Spence, Taylor and their associates 
are primarily and directly concerned with the generalized intervening 
variables of learning and motivation. Franks and the present writer, 
have been primarily concerned in following up the initial suggestions 
of Eysenck relating these variables to individual personality types. 

If one accepts a Hullian account of learning and motivation as a 
working model then both drive, in the form of manifest anxiety (or as 
neuroticism, to use Eysenck’s term) and inhibitory tendencies (associ- 
ated with extraversion) should affect such a procedure as eyeblink 
condition, or any other learning or conditioning process. Spence and 
Taylor have claimed to have demonstrated the relationship of drive to 
eyeblink conditioning, but they, and those who have followed them, 
have largely ignored the at least equal importance of the inhibitory 
tendencies in such processes. ; 

The specific aim of this part of the investigation then was to examine 
the relationship of the inhibitory tendencies and of drive, in the form 
of neuroticism, to eyeblink conditioning in a normal group of subjects. 
Eysenck and Franks have stressed the importance of the inhibitory 
tendencies. Our main emphasis will be on this line of reasoning and, 
following it, we would predict a negative correlation between a measure 
of extraversion and a measure of eyeblink conditioning. From the 
Spence-Hull position, one which, it should be stressed, is in no way In 
Opposition to that of Eysenck, a positive correlation between eyeblink 
Conditioning and a measure of neuroticism would also be predicted. 


Apparatus N 
A detailed description of the procedure and apparatus for the eyeblin 

conditioning tecinigue followed here has been given by Franks (1956) 
and by Franks and Withers (1955). Briefly, to summarize this informa- 
tion, the conditioning was carried out ina sound-proofed room, divided 
across its longer dimension by a curtain. In one half of the room the 
Subject was seated in a chair provided with head and arm rests; 1n the 
other half, the experimenter sat before the apparatus for delivering the 
Stimuli and recording the responses. The subject's field of vision was 
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restricted by a three-sided booth, painted a neutral colour and having a 
small red light in the middle of the panel facing the subject. 

The conditioned stimulus was a tone of 1000 c.p.s. delivered at 60 db. 
above each subject’s bi-aural threshold, through a pair of high quality 
earphones, heavily padded to reduce ambient noise inside the sound- 
proofed room. The unconditioned stimulus was a puff of air delivered 
through a plastic tube at a pressure of 65 mra. of mercury. The two 
stimuli overlapped in time—the tone lasting 850 msec., the puff being 
delivered 450 msec. after the commencement of the tone. Both stimuli 
terminated together. Each stimulus could be delivered alone, or both 
be delivered together in the pre-arranged timing sequence. 

The eyeblink response is recorded by means of a photoelectric cell 
held in the right hand lens carrier of a pair of spectacle frames which 
the subject wears. This carrier also holds the open end of the plastic 
tube through which the puff is administered. As an eyeblink causes the 
amount of light reflected from the eye to alter, a change in the electrical 
discharge from the cell is produced which, after suitable amplification, 
moves the ink writing pen of a recording milliameter. 


Procedure 

The subjects were told that certain automatic responses were to be 
measured. They were seated in the chair in front of the booth and the 
earphones and spectacle frames placed in position. At this time the 
subject’s attention was drawn to the red light and they were instructed 
to watch it whenever it came on. This procedure aids in keeping the 
subject’s head still during the crucial period within which the stimuli 
are administered. Any conditioned response to this light occurs before 
the onset of the tone conditional stimulus as the light is switched on four 
to eight seconds before the administration of the two crucial stimuli. 

The subject’s bi-aural threshold was then ascertained and the ampli- 

tude of the tone set to 60 db. above this threshold. Four conditioned 
stimuli alone were then administered to test for blinks to the tone as 
such. The subjects were then told that they would feel little puffs of 
air in their right eye. They were told to respond naturally to the puffs 
and to neither inhibit blinks nor to close their eyes to avoid them. The 
importance of watching the red light was stressed. Three puffs alone 
were then administered to accustom the subjects to this stimulus and 
to ensure that a blink could be elicited. Three more tones were adminis- 
tered to acquaint further the subject with the stimuli, he was asked if he 
were comfortable and if he understood what he had to do and the con- 
ditioning schedule was then commenced. 

Eighteen conditional stimuli, dispersed amongst thirty reinforcement 
pairs in a partial reinforcement schedule comprised the first part of the 
conditioning procedure. The number of times a subject responded to 
the conditional stimulus when presented alone, constituted the prin- 
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cipal score on the eyeblink conditioning procedure. As a secondary 
interest was whether anything analogous to reminiscence in learning 
occurred with conditioning, a second session was completed one hour 
after the first. Each session occupied half an hour. 


Results 

Table I summarizes the principal findings on eyeblink conditioning. 
It should be noted that Trials *1-18' refers to the first experimental ses- 
sion and Trials ‘19-37’ to the second. The four questionnaire measures 


TABLE 3.1 


Intercorrelations of the number of conditional eyeblink responses with 
various personality measures 


Trials Trials 
‘1-18’ *19-37 Hy Pd B 
"m trials *1-18" --0-661** — —0:016 --0:038 — 0:080 
33. trials ‘19-37’ --0:032 4-0:018 —0-060 
Cc Pt K D N 


40126 +0066 +0183 


+ 32, trials ‘1-18’ +0028  +0-018 
0:082 0:000 


33. trials 19-37 — 0:017 —O-141 40:111 


icant at the 1% 


Henceforth ** will indicate a coefficient signifi i 
» » 5% 


* 
where there are 78 degrees of freedom 


t Reference numbers of variables in principal component analysis. 


thought to be related to extraversion are listed first (Hy, Pd, E and C) 
and the four judged to be related to neuroticism are listed next (Pt, K, 
D, N). Of the correlation ratios in Table 3.1 only that between the two 
conditioning sessions is significant although that between N and trials 


TABLE 3.2 
er of conditioned responses in trials 1-18 with 


Comparison of mean numbi 
that in trials 19-37 


Standard 
Mean Deviation 
32. Trials 1-18 6:397 4:367 
33. Trials 19-37 8-628 4:539 
n = 80 
t = 544** 
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1-18 approaches very close to significance (r = 0-187 is significant at 
the 5% level). 

The analysis of the ‘reminiscence’ effect as such is to be presented 
separately by Franks and Willett (1959) but table 3.2 compares the mean 
number of conditioned responses in the first session with that in the 
second. Table 3.II indicates that a strong practice effect has operated 
from the first set of trials to the second. m 


Discussion 

The results of this study are equally discouraging to an hypothesis 
linking extraversion via inhibitory tendencies to eyeblink conditioning 
or to an hypothesis linking anxiety-drive to eyeblink conditioning. The 
table below however shows that, with one exception, the present failure 
to find significant correlations between the measure of conditioning and 
personality measures is consistent with other work using the same pro- 
cedure and apparatus, when normal subjects have been employed. 


TABLE 3.3 


Pearson product moment correlations between eyeblink conditioning and 
personality measures from five comparable studies using normal subjects 


*E' of *N' of R 
MPI MPI STDCR MAS MMQ 
Franks (1954) N = 20 
(male, female hospit. staff) —0-048 | —0:008 | —0-145 
Franks (1957) N — 55 
(male students) —0-46t 0:04 
Das (1957) N = 68 
(male students) —0:083 | —0:027 
O'Connor$ (1959) N — 14 
(males) —0:38 
Willett (Present study) N =80 
(male apprentice trainees) | —0-08 0-183 


1 Significant at 1% level of confidence. 
8 N.B. O'Connor used the short version of the MPI (Eysenck, 1958). 


The correlation coefficients, expressing the relationship between eye- 
blink conditioning and extraversion (R and E), shown in Table 3.3 
may be regarded either as derived from five separate studies or as esti- 
mates of some true coefficient, obtained from five successive samples. 
Taken as derived from five separate studies, each coefficient is tested 
against zero. On this view, it can be seen that the result obtained in this 
study is consistent with three of the other four studies. Considered as 
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derived from five successive samples varying about some true value, 
the five coefficients would seem to be varying about a relatively small, 
negative value. That this value is unlikely to be zero is indicated by the 
fact that all five coefficients are negative. The negative sign is, of course, 
in accord with the predicted relationship between eyeblink conditioning 
and extraversion. Furthermore it would seem reasonable to suppose 
that Franks’ finding of-a coefficient of —0-46 (1957) differs markedly 
from the others, as no other coefficient could be distinguished from zero 
while it differs significantly from zero. 

To obtain an estimate of the true coefficient a z transformation of the 
coefficients was made (Fisher, 1948) and an average weighted z score 
computed. This z score was transformed back into product moment 
coefficient form and tested against zero. The estimate of the true co- 
efficient obtained by this procedure was —0-188 which, for 227 degrees 
of freedom (Zn — 2)) is significant at the 5% level. 

That Frank's’ result (1957) seems the exceptional case has been com- 
mented upon. If we are dealing with successive comparable samples it is 
difficult to see why this should be. The same conditioning procedure 
was used in all studies and comparable questionnaires were administered ; 
Franks (1954) used the ‘R? scale of the STDCR, from which the E 
scale was derived and O'Connor used a shortened version of the MPI. 
In the other three studies the full version of the MPI was employed. 
Where any differences in questionnaires do exist, in short, no differences 
are evident between coefficients; Franks (1957) used the full version of 
the MPI. Some differences between the subjects of the studies are 
apparent and this could be important. Here again, however, although 
Franks' (1957) subjects were different from those of his earlier study 
(1954), from those of O'Connor (1959) and from those of this study, 
they came from the same population as did those of Das (1957) whose 
results are comparable with all others except Franks’ (1957). Franks 
(1954) himself considered the importance of age, sex and intelligence to 
conditioning and concluded that these factors had relatively little bear- 
ing on this process except at extremes of age and intelligence. If some 
important variable was controlled in the selection of Franks (1957) 
subjects and not in the selection of the other subjects, it 1s not obvious 
from his report what this might have been and what its influence on the 
other studies could have been. We cannot ignore the possibility that 
environment factors, whose influence on conditioning is as yet unknown, 
may have an important bearing on the formation of conditional 
responses. In the absence of any such knowledge, however, Franks 
finding appears as a sampling deviation and we must conclude that a 
small negative correlation represents both the most frequent finding 
and the best estimate of the true correlation between extraversion and 
conditioning. ] z 

One then feels constrained to ask whether this represents a fair test 
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of the prediction derived from the basic hypothesis relating eyeblink 
conditioning to extraversion and, immediately, the possibility that the 
questionnaire measure being employed is a poor criterion of this para- 
meter demands scrutiny. To state this possibility succinctly; we are con- 
cerned with showing that conditioning is related to extraversion in a 
particular way because both are related to inhibitory tendencies. Where 
normal subjects have been employed it is conceivable that the ques- 
tionnaires used do not adequately differentiate between the groups in 
the manner required to test the hypothesis. 

No direct evidence can be brought to bear in support of this sugges- 
tion, however, the following should be borne in mind. When he obtained 
significant coefficients with his abnormal groups, Franks (1954) used 
a double criterion, as has been pointed out. The questionnaires were 
administered, as in the later studies, but the obtained scores had to 
agree with ratings on clearly defined behavioural characteristics made 
by the psychiatric staff before a subject was included in one of the 
tested groups. Thus up to a point, behaviour ratings were guiding 
the selection and it would happen then, even if the questionnaire was 
functioning inadequately, something approaching a satisfactory differen- 
tiation could have been achieved by virtue of these ratings. An on- 
going investigation by Claridge throws further light on this problem. 
He administered a group of tests to psychiatric subjects where, as in 
eyeblink conditioning, certain predictions were made connecting per- 
formance to personality measures. He found that, in many cases, no 
relationship could be found when questionnaire measures were used but 
that employing the same kind of behaviour ratings as Franks employed, 
the relationships, as predicted, did appear. Sigal, Star and Franks 
(1958), investigating the ability of the MPI to discriminate between a 
group of dysthymics and a group of hysterics selected on case history 
symptom ratings by three raters, showed that no differentiation on the 
basis of the questionnaire scores could be made between these two 
groups. 

Furthermore, if the questionnaire measures are not involved, predic- 
tions based on the central hypothesis fare reasonably well. Das, for 
example, found the predicted correlation between hypnosis and condi- 
tioning. There is a reasonable level of consistency in the drug studies 
reported here and elsewhere (vide Volume One) and a certain degree of 
order can be brought to the behavioural effects of psychosurgery (Willett, 
1959) to mention only some of the instances where the concept has 
proved useful. 

However, there is no gainsaying the fact, dramatically illustrated in 
Table III, that when predictions involve questionnaire measures they 
rather too frequently fail to be verified. Note also for example, the al- 
most total absence of any significant correlations in the section on drug 
effects despite the appearance of many predicted effects. One is thus 
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inclined to conclude that when the Maudsley Personality Inventory 
is used (to take our principal questionnaire) to give criterion scores, 
it gives very poor ones. So poor, that no information is added to 
that which might be obtained from investigating the processes them- 
selves. 

This writer feels that progress in describing the connection between 
personality parameters aad fundamental explanatory processes is being 
retarded by the lack of a more valid personality measure. The negative 
results of this investigation exemplifies this lack. 


SALIVARY CONDITIONING 


A procedure hallowed by its associations with the animal investigations 
of the early Russian physiologists, Salivary Conditioning has been fre- 
quently studied in human subjects. (Gantt, 1936, 1938; Razran, 1933, 
1935, 1936, 1939, 1955a, 1955b; Finesinger and Sutherland, 1942; Bindra, 
Paterson and Strezelecki, 1955.) Since this procedure seemed to have 
been employed successfully by these workers and because of its con- 
nections with classical conditioning experiments it was decided to include 
salivary conditioning in this investigation. As before, with eyeblink 
conditioning we were concerned here with the relationship of salivary 
conditioning to the personality dimensions of introversion/extraver- 
sion and neuroticism as measured by the various questionnaires already 
described. It was expected that a negative correlation between intro- 
version/extraversion and salivary conditioning would be found. The 
salivary conditioning procedure was tried out in the pilot study and 
satisfactory correlations were found between a score on this function 
and E and N of the MPI. (r(E and Saliv. Condit.) — —0:57, r(N and 
saliv. condit.) = 0-54), both being significant at the 5% level of 
confidence). j j 

This finding was sufficiently encouraging to warrant proceeding with 
the major study to be described. The procedure in both the pilot study 
and the main study were the same. 


Procedure 
An adaptation of the procedure described by Razran (1936) was fol- 
lowed, particular attention being paid to his observation that subjec- 
tive attitudes affect the formation of salivary conditioned responses in 
human subjects. The subjects were therefore required to Seek a 
ball-rolling maze game which had been previously shown to hold the 
interest of males, over a period of at least two hours. -— 
The task consisted of a maze path described between sets of num. z 
holes on a tray, which could be tilted, either up or down or from left n 
tight, by manipulating two knobs. The subject was instructed na 
the ball along the track as far as he could by tilting the tray, withou 
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allowing it to fall into one of the holes. He was told that his score was 
the furthest point he reached, that it was an advantage to proceed care- 
fully and slowly, that he was not being timed and that if, at the end of 
a trial the ball was still on the tray, he was to leave it where it was and 
commence from that point when the next trial began. He was allowed 
a short time to practise manipulating the knobs and the experimenter 
demonstrated that it was possible to roll the bal*a considerable distance 
along the track without committing an error. 


Technique of Conditioning 


Fitted to the maze game and elevated three inches above it were a red 
lamp and a white lamp each of 6 volts. These could be switched on or 
off independently by the experimenter and provision was made for the 
white lamp to flash on and off once a second. 

Razran's technique (1935) of measuring increments in salivation by 
the insertion of cotton rolls under the tongue was followed (Johnson 
and Johnson, No. 1), with the modification that, on receiving the rele- 
vant signal, the subject placed the roll under his tongue himself with a 
pair of sterilized surgical tweezers and removed it in the same way 
instead of the experimenter carrying out these operations. 

The experimental session occupied about an hour and forty minutes 
and consisted of a number of ‘periods’ as follows. 


(1) The reinforcement or training periods consisted of the subject 
practising on the ball maze with the red light switched on and the 
white light flashing (CS), while sucking 25 mgm. tablets of 1-ascor- 
bic acid (USC). There were two reinforcement periods of ten 
trials, each trial lasting for two minutes and separated by rest 
pauses of thirty seconds. 

(2) Periods of ‘regular control’ consisted of the subject practising on 
the ball maze with both lights switched off. These consisted of 
45-second trials and one trial followed each of the reinforcement 
periods to ensure that the saliva secreted during the latter was 
absorbed. 

(3) ‘Test’ periods consisted of the subject practising on the ball 
maze with the red light on and the white light flashing with the 
cotton rolls in position. These consisted of 45-second trials, three 
trials following the first reinforcement period and five following 
the second. 

(4) ‘Control’ periods consisted of the subject practising on the ball 
maze with both lights switched off. These consisted of 45-second 
trials and one was paired with each ‘test’ trial, so that three fol- 
lowed the first of the reinforcement periods, and five followed 
the second. 


‘Test’ trials and ‘control’ trials thus came in pairs and the order of 
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presenting ‘test’ trials and ‘control’ trials was alternated throughout 
the eight pairs and from subject to subject. 

After the subject received his instructions concerning the ball maze, 
he was given the following instructions relating to the cotton rolls 
and the signals he was to act upon. 

‘While you are practising on this game I am going to take a number 
of measurements from you that will require you to place these rolls of 
cotton underneath your tongue. You will use these tweezers to place 
the rolls under your tongue and also to remove them. You will place 
and remove the rolls, and commence and stop practising the game to 
certain signals. The signals are as follows. 


A. (Relevant to the ‘test’ trials.) ‘When the red light comes on you 
are to place the rolls under your tongue. When the white light 
begins to flash, commence to practise on the game immediately. 
When the lights go out you are to stop practising on the game 
immediately and remove the cotton roll from under your tongue 
and hand the roll to me, in the tweezers. 

B. (Given prior to each of the reinforcement periods.) *This time 
when the red light comes on I want you to take one of these 
tablets, place it in your mouth and commence practising on the 
maze when the white light begins to flash. I don't want you to 
chew the tablet or swallow it—it won't hurt you if you do because 
it is only a flavouring matter; it tastes like a sweet—but I want 
you to suck it all the time you are practising on the maze. When 
the lights go out, stop playing the game immediately and if there 
is anything left of the tablet, remove it and place it in this basket. 

C. (Relevant to the control periods.) ‘When I say “now” you are to 
place a roll of cotton wool under your tongue and commence to 
practise on the game immediately. When I say “stop you stop 
playing the game and hand me the roll in the tweezers. 


Summary of the procedure 

(a) Instructions to S concerning game and cotton rolls. "me 

(b) Three ‘introductory’ trials consisting of three pairs of ‘tests and 
‘control’ procedures, each of 45-second duration. ee 

(c) Ten reinforcement trials with the red light on, the white light 
flashing and S sucking tablets of 25 mgm. l-ascorbic acid; 
each trial of 2 minutes duration, with 30 seconds rest between 
trials. ; 

(d) One ‘regular’ control trial of 45 seconds duration. 

(e) Three pairs of ‘test’ trials and ‘control’ trials each of 45 seconds, 
the order of ‘test’ and ‘control’ being varied throughout the three 


pairs and from S to S. 


(f) Ten more reinforcements as in (c). 
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(g) One ‘regular’ control as in (d). 
(h) Five more pairs of ‘test’ and ‘control’ trials as in (d). 


In periods ‘b’, ‘d’, ‘e’, ‘g’ and ‘h’, the rolls were weighed immediately 
on removal from S’s mouth by means of a torsion balance. They were 
also weighed before being placed in position, to allow for slight varia- 
tions in the size of the rolls. 


Results 
Table 3.4 shows the correlation coefficients between the two related 
measures of salivary conditioning and the questionnaire measures. In 


this table sf 29 


n 
each paired ‘control trial. Other things being equal, the larger this 
difference the greater the degree of conditioning achieved. Because such 
very large variations occurred within and between subjects in the amount 
of salivation, a second score was taken from the simple count of the 
number of times the salivation on the ‘test’ trials exceeded the saliva- 
tion in the paired ‘control’ trials. This score we have called ‘No. of -+’. 


is the mean difference between the ‘test’ trials and 


TABLE 3.4 


Pearson product moment correlations between two measures of salivary 
conditioning and various personality measures 


x co | No. of + HY Pd Nd 
"m 
34. X c9 T0712** | —0-114 10:173 +0:095 
35. No. of + —0:116 +0:018 +0:077 
c' of CPF Pt K D N 
34, X co +0-078 +0:122 | —0-319** | —0-134 --0-079 
35. No. of + —0:067 +0:193* | —0-349** | —0:222* | +0:142 


Apart from the understandably high intercorrelation between the 
two conditioning measures, four reasonably low, but significant corre- 
lations with measures of neuroticism are evident in Table 3.4. 

The individual scores on this conditioning measure were characteris- 
tically extremely variable, particularly from one subject to another but 
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also even within one subject’s record. So much was this the case that 
one could not tell just by looking through the array of scores whether 
on the whole salivation was greater during the ‘test’ periods than during 
the paired ‘control’ periods. This feature of the scores is well illustrated 


by stating that the mean of all the E (T — O scores was only + 14:79 
n 


while the standard deviation of these scores was 65-38. It should be 
remembered that 2: coo is the mean of the differences between the 
eight pairs of ‘test’ and ‘control’ scores for any individual and can be 
either positive or negative. 
T-—C) 
n 


To ascertain whether the mean of the È ( did differ signifi- 


cantly from zero (note that the mean is positive indicating a higher 
degree of salivation under the ‘test’ condition), a simple ‘f’ test was 
carried out using the values for the mean and standard deviation already 
given. For the eighty cases of our study this computation yielded a ‘t 
value of 2:00 which, for this number of cases indicated that the mean of 


the & g-9 scores did differ significantly from zero (P = 0023), a 


requirement to be fulfilled if conditioning is being claimed to have 
occurred, 

_ It still seemed possible that although the ‘test’ trials were producing a 
Significantly greater amount of salivation this was due, not to any con- 
ditioning process but to some simple effect of the blinking lights used 
as the conditioned stimulus. To test this possibility a control study was 
carried out on a sample of twenty cases drawn from the same parent 
population as that of the main study. These subjects were put through 
exactly the same procedure as in the main study with the exception that 
during the twenty two-minute reinforcing periods no 1-ascorbic acid 
was administered. In other words no reinforcing situation was present 
and the comparison that could now be made between the ‘test’ and the 
‘control’ periods was simply the effect of the blinking lights present in 
precisely the same context as in the main study. This time the mean 
difference between the ‘test’ and ‘control’ measures of salivation, com- 
puted exactly as before, was 429-4 and the standard deviation of the 
array of mean differences was 74°83. Again thesignificance of this value's 
departure from zero was computed and the ‘£’ value this time equalled 
1-7 which, for the twenty cases used here, indicates that the departure 
from zero of this mean cannot be viewed with as much confidence as 
that of the main study (P = 0:053). 

The scores of both the main study and the control study were 
examined to discover whether any increase in amount of salivation 
throughout the period to testing occurred but in both cases the sum of 
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the first two measurements taken over all the subjects was almost pre- 
cisely the same as that of the last two. 

Finally the correlation between amount of salivation and E and N 
was computed. The salivation score used here was the sum of the six 
45-second trials (three ‘test’ trials and three ‘control’ trials), given at the 
commencement of the procedure to accustom the subject to the tech- 
nique of inserting and removing the cotton rolis. A correlation coeffi- 
cient of —0-13 was obtained between amount of salivation and extra- 
version (E) and of 0:13 between amount of salivation and neuroticism 
(N). Both personality measures were taken from the Maudsley Per- 
sonality Inventory and neither value was statistically significant. 


Discussion 

The degree of variability present in the salivary responses in this 
investigation seems a most unsatisfactory feature of this technique to 
this writer. As has been pointed out, the variability was not only between 
subjects but within the responses given by any one subject. The whole 
range of the torsion balance was used, some subjects giving readings of 
less than 100 milligrams and others ten times as much. Individual varia- 
tion was not quite as great but ratios of three or four to one from one 
response to the next were not uncommon. It was the noting of this 
variability in the pilot study that led to the practise of alternating the 
order of ‘test’ and ‘control’ trials within and between subjects and, in 
consequence of it, one cannot feel very confident that conditioning was 
achieved at all. The control study did clarify this situation somewhat 
and suggested that the conditioning apparently achieved was an artifact 
of the stimuli used. Thus the use of the salivary conditioning measures 
to further our knowledge of the relationship between personality para- 
meters and conditioning must be viewed in the light of the uncertainty 
concerning the measures employed. 

It was recognized early in the investigation that, to a far greater 
degree than is the case in eyeblink conditioning, the stimuli of salivary 
conditioning are very complex. The conditioned stimulus in our proce- 
dure is clearly not only the blinking lights but the whole complex of 
placing cotton rolls in position, playing a ball game, being timed, seeing 
cotton rolls being weighed and so on. Almost all of this complex is 
carried over into the control situation, the only difference being the 
change of signals and the absence of the lights. Certainly, as the experi- 
ment was carried out in a room only dimly lit, the absence of the lights 
was very conspicuous, nevertheless, the degree of similarity between the 
‘test’ and ‘control’ situations was considerable. It was for this reason 
that the scores were examined for any overall increase in amount of 
salivation throughout the course of the testing session. A general rise 
in salivation could have been a reflection of a conditioned response of 
salivation developing to the total stimulus complex. The control study 
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proved unnecessary from this point of view since no overall increase 
was evident in it or in the main study. 

We examined the relationship of amount of salivation to the per- 
sonality parameters as an interesting side aspect of the investigation 
because, salivation being closely related to antonomic activity, it was 
thought that a relationship might exist between this measure and neuro- 
ticism. Wenger (1948) included salivation as one of the indicators of 
antonomic activity in his study of antonomic balance in army air-force 
personnel. In a series of comparisons between aviation students, psy- 
chiatric groups and ‘Operation Fatigue’ groups it would appear, from 
Wenger’s study, that with some inconsistencies, salivation discriminates 
between groups where the degree of emotional stress varies. In general, 
the direction of the difference is opposite to that one might have 
expected. In all but one of the comparisons where significant differences 
were found, the group characterized by being rather more ill showed less 
salivation. In a centroid factor analysis of his data Wenger identifies 
salivation as an indicator of antonomic activity although, in fact, its 
loading on the factor he identified with antonomic activity was only 

:25, 

Wenger also correlated his physiological measures with the scales of 
the STDCR. In agreement with the results reported in this study he 
found no significant correlation with D and C (elsewhere shown to be 
clearly related to neuroticism, Franks, 1954) but reports a significant 
but very low correlation of +0:18 with R from which the E scale of the 
MPI was derived. P 

Coming to the principal result then we shall have to bear in mind the 
points already made. Namely, that whether conditioning occurred at 
all, seems problematical and salivation, as such, does not seem to be 
very closely related to either neuroticism or extraversion, as measured 
m Maudsley Personality Inventory or the D, C and R scales of the 

DCR. 

We note first of all that, as in eyeblink conditioning neither E nor N 
of the Maudsley Personality Inventory shows any sizeable relationship 
with the measures of salivary conditioning. The four significant coeffi- 
cients found (apart from that between the two conditioning ieas 
are with other measures only associated with neuroticism. Those wit 
K, the ‘Faking good, Faking bad’, scale of the MMPI, are significant 
at beyond the 1% level. 

D here is the lie scale of the Contact / Venio 
and gives the next highest correlation with the salivary TEN 
measure. If in fact these significant correlations have not simply xa 
thrown up by chance from amongst the very many computed, docs this 
result mean that these scales are better measures of personality pem 
sions relevant to conditioning than the other scales? It yon Tees 
Dot, for neither show any degree of concomitance with eyebir 
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conditioning, a more reliable conditioning procedure than this of 
salivary conditioning. 

The correlations, in the case of both K and D, are negative such that 
high scores on these questionnaire measures are associated with low 
Scores on salivary conditioning. In this study both K and D have 
negligible correlations with extraversion (E of MPI) but both have 
highly significant negative correlations with N-(of MPI) such that high 
scores on K and D are associated with low scores on N. It should also 
be noted that K and D correlate with each other to the extent of 0-5. 

If anything is to be made of these facts it must be that persons with 
low scores on neuroticism also get low scores on salivary conditioning. 
In other words since K and D are in a sense the inverse of N and K and 
D correlate negatively with salivary conditioning, a positive relation- 
Ship between salivary conditioning and N might be said to have been 
demonstrated. However, neither K nor D functions in the same way in 
relation to the other measures of conditioning, as has been pointed out. 

Bearing in mind the connection between N and D (of STDCR) and 
Wenger's (1948) results, the correlation between K and amount of sali- 
vation was computed in the same way as was done with E and N. 
Again this measure of salivation bore no relationship to the personality 
measure (r = 0:08). N 

In general one must conclude, as in the case of eyeblink conditioning, 
that no meaningful or consistent relationships between this condition- 
ing measure and the personality parameters has been demonstrated. 
The criticisms one made in the section on eyeblink conditioning about 
the inadequacy of the personality measures applies in this case also of 
course, but with this measure the situation is worsened by the uncer- 
tainty of whether conditioning occurred at all. No firm conclusion 
either negative or positive can be drawn from the results of this study 


therefore. 


VERBAL CONDITIONING 


The commencement of interest in the phenomenon of Verbal Con- 
ditioning, or Probability Learning, as it has more appropriately come 
to be named, is commonly associated with a study of Humphrey’s 
(1939). In this study the subject was required to say whether, when a 
light came on, a second light would follow it or not. In Humphrey’s 
study the second light followed 50% of the time. Grant, Hake and 
Hornseth (1951) employed ratios of 25%, 50% and 75% in repeating 
Humphrey’s study. In both cases it was found that subjects would match 
the frequency of their ‘yes’ guesses about the second light to the actual 
probability of its appearance, although the sequence of appearance and 
non-appearance was, to all intents and purposes, random. This pheno- 
menon of frequency matching we have investigated in its relationship 
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to individual personality measurements by examining the rate and regu- 
larity with which subjects changed from one response frequency to 
another. 

We have taken the simplest case of probability learning where cues 
are minimal. The determining cognitive conditions and general rami- 
fications of this and of related processes are many. An excellent account 
of these will be found im Bruner, Goodenow and Austin (1956). 

The general design of the experiment required the subject to accept 
first a particular frequency of appearance for one of a pair of stimuli 
in an initial series of trials. The ratio chosen was that used by Humphrey, 
0-50. In other words on half the choice occasions one choice was ‘cor- 
rect’ and in half, the other. It was hoped thus to establish a frequency 
matching tendency in this period. Then, without the subject’s know- 
ledge, the ratio was changed so that, approximately one choice was 
‘correct’ twice as frequently as the other. It was hoped to plot the course 
of adapting to the new frequency ratio and to relate, particularly, the 
rate of change to the personality variables. . 

If a conditioning model is appropriate to this task (a matter to be 
discussed) then one would expect from the hypothesis about extra- 
version and conditioning that subjects with low scores on this factor 
(i.e. the introverts) would readily match the stimulus frequency in their 
responses and then only slowly change to the new frequency. That is, 
they would ‘condition’ well and extinguish slowly. The persons with 
high scores on extraversion would not match the initial frequency as 
well and would change more easily but more erratically. In the pilot 
study a positive correlation was obtained between rate of change and 
extraversion and a negative correlation between variability and extra- 
version. Neither was significant statistically but they were of sufficient 
size to warrant the work undertaken in the main study. 


Apparatus 


The apparatus consisted of two white six-volt lamps set pnm 
inches apart in the middle plane of a grey board three feet by a 
Nine inches above the mid-point of the line joining the two white lig 4 
was a six-volt red lamp. The white lamps were designated p (D d 
right (R) and the red lamp referred to as the ‘signal light’. The boar 
was held in a stand so that these three lights faced the subject. A 
Behind the board, where the experimenter sat, were a duplicate i o 
lights, which came on simultaneously with the lights seen by the su ie 
This was to ensure that, barring bulb failures, the epea aon 
be immediately aware if any of the lamps failed to work. In t E M 
base of the board the experimenter was provided with two dein va 
One controlled whether any of the three lights would come ba hae 
Other, a selector switch, allowed the experimenter, once the first ae 
closed, to select which of the two stimulus lights would come on. Car 
175 


R. A. Willett 


was taken to ensure that both switches were silent. The selector switch 
was interposed between the lights and a timer. The timer was arranged 
so that, with the switches in the appropriate position, the red light would 
come on for five seconds, then one, the other or both of the white 
stimulus lights would flash automatically. The experimenter was re- 
quired merely to select which stimulus light was to follow the signal 
light. The timing, selected on the basis of the"pilot study, was: Signal 
light on for five seconds, stimulus light flashed (== 3th second), no light 
on for ten seconds, signal light on, etc. 


Procedure 


The subject was seated before a desk in a curtained room facing the 
front of the apparatus. He was instructed in the following terms: 

“You will notice that on this board there are two white lights marked 
left and right and a red signal light. Every fifteen seconds or so the red 
light will come on and after a few more seconds one of the two white 
lights will flash like this . . . (the subject was then shown one or two 
cycles to acquaint him with the stimuli). Now I want you to tell me 
each time the red light comes on, which one of the two white lights is 
going to follow it. You must say “left” or “right” before the white light 
flashes for your answer to count and, as you have seen, you have not 
got much time.’ (The subject was then given a few trials to accustom 
him to the task of responding.) 

After any further explanations that were required the investigation 
began. 

The experimenter was provided with record sheets on which the two 
schedules were laid out with spaces opposite for recording the subject’s 
responses. 

Preliminary testing had shown that, as was suggested in the literature 
(Grant, Hake and Hornseth, 1951), seventy trials were adequate to 
establish a frequency matching of 50% and consequently a schedule of 
seventy trials, in which equal numbers of left and right choices were 
‘correct’, was arranged by reference to a table of random numbers. 
Likewise, preliminary testing had indicated that a ratio of seven to 
thirteen was sufficiently ‘difficult’ to yield a good range of matching 
rates through seven sets of twenty trials. Accordingly this ratio and this 
number of trials was adopted in the main study. Within successive 
blocks of twenty trials, then, in the second series of trials, seven left 
‘correct’ choices and thirteen right ‘correct’ choices were arranged by 
reference to a table of random numbers. " 

It was found that very few subjects failed to respond in the choice 
situation. If this happened more than four times the test was aban- 
doned. In those few cases where failures occurred, other than these, the 
subject's record was marked with the ‘correct’ choice. The number © 
occasions when this occurred was, however, negligible. 
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Results 

That frequency matching occurred in conformity with the first schedule 
of seventy trials, where the ratio of left to right correct choices was 
0-50, is shown by the mean number of left choices made by the eighty 
subjects. This mean was 34-69 + 3-80. Perfect matching would have 
been indicated by a mean of 35:00 + 0 left choices. 

_ That adaptation to the new ratio did occur to an appreciable extent 
is shown by Table 3.5. 


TABLE 3.5 


Mean number of ‘left’ choices made through seven series of twenty trials where 
the ratio of left to right correct choices became 7 to 13 after seventy trials where 
the ratio was 0:5 


Successive sets of twenty trials 
3. 4. 5. 


E 2 


Mean no. of Left 


choices 11:84 961 10:25 9:90 820 861 8-70 


Had a perfect and smooth adaptation occurred in this group the 
Mean values would have dropped from rather Jess than a mean of 
e in the first block of twenty trials to one of 7-00 in the seventh 

ock, 

To examine whether subjects with low scores on extraversion gave 
closer and more consistent frequency matches than subjects with high 
scores on extraversion, the mean number of left choices and the appro- 
priate standard deviations from the first schedule of seventy trials were 
computed for the group whose E scores were below the mean E score 


of the whole group and for the group whose E scores were above the 
mea d deviation of the number of left 
n E score. The mean and standard de ental 


Choices for the rers (the introverts) was 34 f 
for the high A er +410. These values are virtually 
indistinguishable. itioni i 

In exihiniag the main predictions associating conditioning with pr 
sonality parameters three scores were computed: a score whic n i ee! : 
the gross change in the number of left choices made by the su od h 
score which reflected the rate of change and a score which indicated the 
variability of the number of the subjects ‘left? choices over m om 
blocks of twenty trials. Since there were half the number of ec d 
first schedule of trials as in the second, the first puis T edhe 
simply by subtracting twicethe number of left choicesin t ; my 
from the total number of left choices in the second schedule 
hundred and forty trials. 
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TABLE 3.6 


Pearson product moment correlations of, gross change in left choices from the first schedule of 0-50 ratio to one of 7:13, rate of 
change and variability of number of left choices over seven sets of twenty trials with personality measures 


Change Rate ; 
in Left f Vari- E of C of K D N 
Choices | Change | ability | EY Pd MPI | CPF P 
42. Change in Left 
choices +0:219* | —0-236* | —0-182. | +0-008 | +0077 | —0:052 | +0:241* | —0-120 | —0-008 | +0:221* 
43. Rate of Change —0-025 | —0-093 | —0122 | —0-118 | +0051 | —0-109 | --0-117 | +0-180 | —0-186 


44. Variability +0-218* | --0220* | +0:083 | 0:044 | —0-025 0029 | 0078 | —0-157 
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The scores for rate of change and variability were obtained in the 
following way. The number of left choices in each of the seven blocks of 
twenty trials, for each subject, were treated as points lying about a 
straight line. This straight line of best fit was computed for each subject 
and its slope determined. This slope then represents rate of change. 
Variability was expressed as the sum of the squared deviations of the 
oe points from the corresponding points on the straight line of 

est fit. 

The direct measure of ‘conditioning’ hereis the rate of change measure, 
although some interest attaches to the measure of variability and to the 
Bross change scored. Table 3.6 shows the correlation of these measures 
with the personality parameters. 

In the above table Pt and N correlate significantly with the gross 
change in number of left choices while HY and Pd correlate significantly 


with variability. 


Discussion 

Unlike the case of salivary conditioning t 
here the procedure as such was reasonably successful. A frequency- 
matching response that was very close to perfect was established in the 
group as a whole and the extent to which an individual’s scores varied 


about the mean matched frequency was quite small. With the introduc- 
tion of the second schedule and the new ‘correct’ stimulus to ‘incorrect 
esponses did occur towards 


stimulus ratio, a gradual adaptation in ther j 
an approximation to the new ratio. The decrease in the number of ‘left 
choices made by the subjects was small but consistent. Only 15% of the 
subjects did not give responses yielding a negative, i.e. falling, slope. 
That the gross change score was also negative is a reflection of the 


same fact. 

When one turns to the relationship between these scores and the per- 
sonality measures, however, one again has little that is positive or con- 
clusive to report. If the extraversion scale of the MPI is to be judged as 
a valid measure of this personality para e was no indica- 
rw in this study that introverts returned ly matched fre- 

uency responses than extraverts. . 

Some positive correlations are present between the scores derived 
from the second schedule and the personality measurements. Neuro- 
ticism, from the MPI and psychaesthenia from the MMPI both fae 
late Positively and significantly with the change scores. Likewise the 
hysteria and psychopathic deviate scales of the MMPI correlate posi- 
tively and significantly with variability. 
ae the case of neuroticism d 2 xo = 

ation between he MPI and Pt, in i 
Thus, although aes cótielatum of this size the variance odes 
for by the concomitance is quite small, we have some justification [o 
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regarding the two scales as measuring the same thing. If we call the 
variable being measure by both scales, neuroticism, for the purposes 
of this discussion and bear in mind the size of the correlation between 
the two personality scales, we will not then do the data too much 
violence. 

Each scale correlates with the gross change score to almost the same 
extent, 0-241 in the case of Pt and 0-221 in the case of N. Because of 
the way in which the change score is derived (and because the array of 
change scores is very largely negative) the larger the change the larger 
negative quantity the change score will be. Thus if the relationship 
between change score and neuroticism were truly positive, the correla- 
tion coefficient would have appeared negative. Our coefficients mean 
then that the smaller the change the greater was the degree of neuro- 
ticism. This result makes little sense on a Hullian-Spence view of neuro- 
ticism nor in the terms in which Eysenck might reformulate their view. 
The curvilinear relationship between performance drive and task com- 
plexity could account for the result but unless one can, by varying 
experimental conditions, demonstrate the law operating inversely the 
ad hoc involving of this effect is not very convincing. An ad hoc expla- 
nation which, however, does seem to have greater face validity is that 
the two personality scales are, in part, measuring a rigidity factor and 
that the change score represents a rigidity score such that now the co- 
efficients are to be understood as indicating: the smaller the change 
(the larger tne performance rigidity) the greater the (questionnaire 
measured) rigidity. 

That the rate of change score does not reflect the same tendency 
renders this suggestion a little more tenuous than if the results had been 
consistent but, as the correlation between rate and gross change is only 
0:219, the proportion of the variance this coefficient explains is very 
small indeed and there is a great deal of room for other factors, includ- 
ing error, to play a part. The two correlations with rate are, in fact, 
insignificant, as are all the other coefficients involving the rate score. 
This rate of change, it was initially felt, would best indicate the condi- 
we procedure, if the procedure can be regarded as conditioning 
at all. 

Variability yielded two significant correlations with the HY and Pd 
scales of the MMPI. These scales correlate together significantly 
(4-0:205) but neither shows any large degree of concomitance with 
either E or N of the MPI. Variability was not regarded as a condition- 
ing measure but it was expected that a more variable performance 
might be expected of a subject who was not adopting any particular 

frequency-matching ratio. Again, however, we have no evidence that our 
expectation was borne out. Unlike the study on the basis of which the 
questionnaire scales were grouped (Franks, Souief and Maxwell, 1959), 
HY and Pd do not correlate, in this study, with extraversion, and extra- 
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version does not correlate with the performance measures. There is no 
evidence to suggest even that these two scales are better measures of 
extraversion than E since the two scales yield no other comparable 
correlations with any of the other performance measures. In fact one 
cannot avoid concluding that these ratios seem more than likely to have 
been thrown up by chance. 

What kind of learning’process does probability learning most closely 
resemble, one must now ask ? That an adaptation and modification of 
the subjects responses occurs is beyond doubt. That this is not prima- 
rily a question of intelligent problem-solving behaviour, in which insight 
plays an important role, is largely ruled out by the nature of the task 
and contra-indicated by the lack of any sizeable or significant correla- 
tions between scores on the procedure and scores on the test of general 
intelligence (NIIP test 90A: raa and 90A = — 0:184 Trate and 
90A = — 0-029, ryariabitity and 90A. = 0:153). This observation does not 
imply that the behaviour does not ‘occur under a problem-solving set’ 
(Bruner, Goodenow and Austin, 1956) but serves to emphasize that if 
probability learning is to be regarded as a problem-solving process then 
itis more analogous to trial and error learning or to some other pro- 
Bressive process than to insightful solution finding. The special proper- 
ties of insight are an awareness of and a suddenness in solution and a 
general reperception of all the elements of the problem in the new light 
of the solution. 

Now this is not the situation we have at all in the procedure we have 
used. As we have pointed out the progression towards correct solution 
1s gradual and the subject is not aware that he has reached it. The cor- 
Tectness or otherwise of his solution is only statistically implicit in his 
Tesponses, Far from seeing the sequence of choice points in some new 
light whose source was a correct solution, very often subjects were 
Convinced that they had discovered the ‘pattern’ of lefts and rights Ree 
the experimenter was following when, in fact, no pattern existed, other 
than the purely statistical one of an over-all frequency of occurrence 


for the ‘left? light. ; 

Trial and aim learning involves, essentially, the selection, Vw 
form of reinforcement, of oneresponse amongst à number. Moy ee d 
Probability learning situation a maximum Score can only be ac I 
if the Subject matches the choice perfectly. A perfect match pu 
guarantee a maximum score but it is the only response pattern tha “a 
give one and the subject has been instructed to attempt the maxim 


Sco i ; involved in this process is 
re. A detailed analysis of the parameters 1 6) and concerning 


to be found in Bruner, Goodenow and Austin (1956) MG P arat 

the simple type of probability learning employed in this eee 

they describe subjects as being guided in their choices Eu Um ren 

frequencies and also by certain other specific factors. The ese 

tioned as most important are the tendency to regard events as 
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in time as connected in other ways (i.e. of being mutually dependent) 
and the tendency to ‘operate on the basis of shorter runs of events than 
occur on a choice basis’. Now it seems to us that the subject’s pro- 
gressive approximation to a true frequency may most fruitfully be 
viewed as a series of trials, necessarily continuous through the nature of 
the problem, in which certain responses (incorrect matches) are dropped 
out until, by the touchstone of the recent run'of successes and failures, 
an approximation to the correct response (i.e. the true frequency match) 
is made, tested and adopted as the ‘correct? maximally rewarding 
response. 

Trial and error learning does seem to be a reasonably good model 
for this process, then. Can it be regarded as conditioning in any sense? 
After experience with the procedure it seems doubtful. The conditioning 
paradigm is a simple one. When one stimulus is associated with another 
which of its nature elicits a response, the response to it becomes at- 
tached to the associated stimulus which previously did not elicit this 
response. What in probability learning would we regard as the uncon- 
ditional stimulus? The response to be conditioned is the true frequency. 
Can this frequency, in the stimulus schedule, be said to elicit the match- 
ing response in the same way as, for example, the puff of air elicits an 
eyeblink in the eyeblink conditioning procedure? It would seem that it 
cannot and if the matching response were so regarded what would 
one regard as the conditioned stimulus? Could the lights themselves 
be regarded as the conditioned stimulus? No, because within the 
terms of the investigation the lights can never be in any context, 
other than that of some frequency of occurrence ratio to which sub- 
jects will adapt. In other words the conditioned stimulus is inseparable 
from the stimulus which elicits the unconditional, to-be-conditioned 
response. 

The best way of viewing the procedure, from the point of view of the 
general aims of the investigation then seems to be, not as a true condi- 
tioning process but as an associative or learning process in which a 
response tendency is built up and modified to meet a changed condition. 
It would not have been surprising, if had positive results been reported 
in the cases ofthe two previously described investigations, this technique 
would have produced a different result. Since all threcinvestigations were 
inconclusive one can hardly compare them. On the whole, however, 
one would have expected the extravert group to be relatively less effi- 
cient in their learning here also and hence one could, quite legitimately, 
investigate personality differences in this type of process and, if the 
data were at hand, could quite justifiably look at the various inter- 
relationships between the processes and the personality parameters. 
The results that were in fact obtained, however, are too fragmentary to 
base any conclusion upon even if the process is regarded in this way an 


not as true conditioning. 
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SPATIAL ‘CONDITIONING’ 


The technique employed in this investigation was suggested by work 
carried out in the study of synaesthesia.! The phenomena of synaes- 
thesia refers to the association of the characteristic response of one 
sense modality with a stimulus usually relevant to another modality. 
This study has led to an:examination of how one type of stimulus (saya 
tone) affects the perception of another. That an effect of this type is 
produced? suggested the procedure to be described. Essentially, this 
procedure attempted to cause subjects to associate the location of a 
tone in a semi-circular horizontal plane with the position of a light in 
the same plane. 

Again, the procedure was initially conceived as an example of con- 
ditioning and it was expected that measures on the performance task 
would be related to personality parameters estimated from question- 
naires. More particularly it was expected that any association would be 
less marked for the subjects with high scores on extraversion. 


Apparatus 
The apparatus consisted of a semi-cylindrical booth whose long axis 
was in the vertical plane. The booth was constructed of metal supports 
joining two semi-circular wooden members which formed the top and 
the base of the booth. The whole framework was covered with a white 
blanket material which was reinforced on the top by heavy felt. A scale 
marked on a strip of heavy drawing paper eight inches wide was glued 
horizontally around the inside of the booth so as to be at eye level for 
a sitting subject. A second and identical scale was fixed behind the first 
for the use of the experimenter. The scale was marked with zero at the 
centre and with fifteen divisions by fives, to the left and to the right. 
The positions to one side were marked in green and those to the other 
in red so that one half of the scale read from zero at the centre to 75 
green at the periphery and the other read from zero to 75 red at me 
periphery. At each end of the scale was a ‘pearl’ 25-volt lamp enclose 
in a shade which gave a diffused light of low intensity inside the booth. 
Two arms were fixed at the locus of the top semi-circular member, 
one above the other. These led across the top of the booth and then 
bent down, so that their ends described a semi-circle, parallel with the 
central points of the scales. On the end of one arm was a point light 


source and on the other a sound source. Both arms projected back from 
i 6, 
1 Langfels, H. S., ‘Synaesthesia’, Psychol. Bull., 1926, 23, 599-602 and 1929, 26, 
582-5; Kelly, E. Lowall, ‘An iepr io produce artificial axem by mo 
technique of the conditional response’, J. Exper. Psychol., 1937, 1m3 js jim bs, 
S. H. ‚The experimental development of colour tone synaesthesia’, J. T dà 1 7 
1944, 99, 660-7; London, I. D., ‘Research on sensory interaction in the Soviet 
Union', Psychol. Bull., 1954, 51, 531-68. 


1 Howells, 1944. 5 
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their locus a short distance in front of the booth where the experimenter 
stood. On each of these projections was a pointer which moved around 
a semi-circular scale divided in the same way as the scale seen by the 
subject, so that by placing the point to a corresponding portion on the 
investigator’s scale the stimulus could be delivered at any required point 
on the subject’s scale. In the mid-point of each division on the subject’s 
scale a round hole was cut in the paper so that the blanket material 
showed through. When the light was switched on it appeared as a 
diffuse spot in the centre of one of these holes. 

The travelling light source was a directed six-volt lamp. The travelling 
tone source was a telephone ear-piece being stimulated by a small pure- 
tone generator located behind the booth. The tone was set at 1,000 
C.P.S. and at 50 db. on the tone generator meter—this gave a tone to the 
subject of quite audible but low amplitude. The experimenter was pro- 
vided with a ‘lectern’ bearing the tone and light control switches and a 
board for recording the subject’s responses. Except for the low intensity 
lamps on the booth the room was in darkness. The room in which the 
investigation was carried out was sound-proofed and was lined on the 
inside with sound absorbent board. 


Procedure 

The subject had previously been given a screening test for auditory 
defects and had been interviewed concerning any physical disabilities 
he suffered. Because of the screening procedure involved in admission 
to the apprentice training school from whence the subjects were taken, 
very few subjects had to be rejected on these grounds. On introduction 
into the test room the subject was seated in a small chair facing the 
inside of the booth and the subject’s scale. Care was taken to see that 
the subject’s head was approximately beneath the locus of the scale. 
The task was then explained to the subject in the following terms. 

‘I want you to tell me the direction from which a number of tones 
come. The tone will sound at any of the points about this scale in front 
of you. You will notice that the scale goes from zero to 75 green and 
from zero to 75 red. The tone will always come from one of the positions 
marked, never from anywhere else and the directions will be more oT 
less evenly distributed about the thirty positions on the scale. I want 
you to tell me each time there is a tone where it comes from by saying 
the position; for example it might be 25 green, 65 red and so on.” 

The subject was then given several practice tones and asked to identify 
the direction of each one. If he was in error he was corrected for these 
trials but never again at any other time. When it was clear that the 
subject understood the instructions, series A was administered. This 
consisted of twenty-five trials of identifying the direction of the tone. 
At this stage no light was employed. Series A served the double pur- 
pose of giving a base line indication of the subject’s ability to make 
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the judgements and to acquaint him thoroughly with the task before the 
crucial part of the investigation began. During this session the subject 
was instructed to keep his head still while the tone was sounding and to 
watch the zero point in front of him. The tone delivered throughout was 
a double one. Two short pips separated by about half a second were 
found more convenient than one pip. The first pip served to attract the 
subject's attention and*generally to orient him towards the required 
direction. Most subjects reported that they based their judgement on 
the second ‘pip’. 

When series A was completed the subject was instructed: 

‘From now on a white light will shine through the holes around the 
scale. Just as the tone, the light may appear anywhere from zero to 
75 green or 75 red. Whenever you see the light I want you to turn your 
head towards it and to look at it just as you have been looking at the 
zero. Keep your head still as before, listen for the tone and then tell me 
where you think the tone has come from. Your task is just the same as 
before—to tell me where the tone has come from. Since the light may 
appear at any of the points about the scale you will have to be scanning 
round to see it. As soon as you do, keep your head still and watch it.’ 
The subject was then given several trials with the light alone so as to 
enable him to recognize it. . 

From this point three more series constituted the main part of the 
procedure. Series B consisted of twenty-five trials in which both the 


positions of the light and of the tone paired with it, were decided upon 


separately, by reference to a table of random numbers. Thus each varied 
f the other. The posi- 


randomly about the scale and was independent o l r 
tion of the light seen by the subject bore no regular relationship to the 


direction of the tone which followed it. Series C followed series B with- 
out any interruption and the two were continuous as far as the subject 
was concerned. In Series C, twenty-five trials were given, where the 
light and tone were delivered from the same position on the scale. 
Again these positions were determined by reference to a table of random 
numbers. Dispersed amongst these twenty trials were fivein which tone 
and light were separated as in Series B. The thirty trials of series C thus 
formed a schedule of partial reinforcement. Series D followed series C 
also without interruption and again the two were continuous from the 
subject's point of view. Series D was the same as Series B in that light 
and tone occurred independently and randomly about the scale. — 

The whole procedure occupied about half an hour. No time limit 
was set to the subject’s judgements but he was encouraged to make quick 
decisions by pointing out that his immediate impression of direction was 
probably the most accurate. As soon as one response was given the 
pointers of the stimulus arms were set again and a new pair was given. 
In all cases where the light was employed a period of one second elapsed 
after the subject had seen the light before the tone was sounded. 
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The procedure yielded a great deal of data concerning the accuracy 
of judgements in this type of task and the influence of the light on these 
judgements when the latter were compared to those in which the 
light was not employed. However, to restrict the available information 
to the scope of this section we will deal only with the change in judge- 
ments from the series B to the series D. 

It was believed that series C would act like a:conditioning procedure, 
that because the light preceded the tone and because the two always 
occurred together the position of the light would come to signify the 
direction of the tone. Series B and D were necessary to test the effect of 
this ‘conditioning’ series. It was felt that the effect would be indicated 
by an increased tendency for errors in judging the direction of the tone 
in series D to be towards the light, wherever the light might be. It was 
also expected that this tendency would be greater for the group of 
introverts. 

As in the case of salivary conditioning it seemed here a wise precau- 
tion to carry out a control study to determine whether the effect of the 
light on the position of the tone altered with the passage of time simply 
by virtue of the number of judgements made. Accordingly twenty sub- 
jects were put through the schedule with a series randomly independent 
substituted for series C. Here, no change in the relationship of the 
light to the tone was made from series B, to series C or from the 


latter to series D. 


Results 

As was intimated, this study yielded a good deal of information which 
will not be included here. Two scores were taken as indicating the 
‘conditioning’ effect of series C. They will be referred to as D-B and 
No. of +. The first was obtained by calling a positive error, any error in 
judging the direction of the tone which placed the tone closer to the 
light than it was in fact and, a negative error, one which displaced the 
tone away from the light. The algebraic sum of these errors was then 
computed for series B and D, for each subject and the group mean of 
sum’s difference between the two calculated. This group mean was 
+79-49 + 128-82. With eighty cases it yielded a ‘£?’ value significant at 
better than the 1% level (¢ = 5:5) suggesting that the sum of the 
+ errors in series D was indeed larger than that of series B. 

This seemed to suggest that the procedure was working as was hoped. 
Nevertheless, it was felt wise to make the same comparison of B and D 
when no pairing of light with tone intervened. Accordingly the twenty 
subject control study was carried out. This time, instead of the sum 0 
all the errors in D being positive and those of B being negative as was 
the case in the main study, both sums were negative, but that of B 
was the larger quantity, so that the value of D-B was positive for this 
study also. The group mean difference was here + 147:25 + 88:13 
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Pearson product moment correlations between * D-B' and No. of + and personality parameters 


| DB | No.of +| HY Pd Por aot Pt | K D N 
| | | 
| i 
40. D-B | 40531 | +0002 | —oox | —o16 4.0063 | —0-054 | —0-085 | —0163 | —0-035 
41. No. of + | —0413 | —0-125 | —0250*| —0-182 | +0096 | —0403 | —0222*| -+0077 
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which again yields a ‘t’ value, significant at better than the 1% level 
(t = 7-4). 

The same result was found when the measure taken was the No. of +. 
This score was obtained simply by counting the number of + errors in 
each of B and D, i.e. the number of displacements towards the light, 
and by subtracting the number in B from the number in D. By taking 
account only of the direction in this way it was hoped to obviate the 
effect of the occasional very large error. In the main study the group 
mean difference between these measures was +1-08 + 3-19. With eighty 
cases this mean difference was significant at the 1% level (t = 3:00), 
again suggesting that more + errors were made in D than in B. 

When the same comparison was made in the control study the group 
mean difference was 4- :4 + 3:41. These values yield a ‘t’ ratio which 
is just on the borderline of significance at the 5% level (t = 1-8). 

Table 3.7 shows how these two scores, D-B and No. of +, related 
to the personality variables. 

Table 3.7, in which the principal predictions are tested, shows that 
only No. of + and E of MPI and No. of + and D yielded significant 
correlation coefficients. 


Discussion 


The results of the control study, as in the case of salivary conditioning, 
renders the meaning of the main study ambiguous. If the results of the 
control study are to be taken at their face value and, since the subjects 
were drawn from the same population and the procedure identical, 
other than the controlled variable, there seems no reason why they 
should not, then it appears that our procedure measured not a condi- 
tioning process but the effect of prolonged exposure to the task; the 
more often the light and tone were temporally associated the more the 
light appeared to affect the perception of the direction of the tone irre- 
spective of how the two were related spatially. 

The effect seems to us now to be more complex than the original 
conception of spatial conditioning and would seem to be another in- 
stance of the general principle of responding in a situation characterized 
by scant cues, already noted in connection with verbal conditioning. 
Other things being equal, this principle appears to assert, events that 
are associated in time are associated in other ways and the more fre- 
quently they are associated in time, again other things being equal, the 
more closely they are associated in other ways. In our study the effect 
was, in fact, rather stronger in the main study where the light and tone 
were paired both temporally and spatially but it was still strong, even 
in the absence of spatial association. 

It is believed that the negative sums of errors were obtained through 
the overpowering influence of some very large errors which, by virtue 
of the positions of the tone and light in some trials, were bound to be 
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negative. These errors were associated with trials in both series B and 
D and at least one instance of such an error occurred with about a third 
of the subjects. The explanation for these errors might lie in echo effects 
in the experimental room or might have had something to do with the 
positioning of the subject. Since no conclusive evidence pointed to these 
being errors of any different kind to the others it was felt they had to be 
included. By and large they approximately compensated for each other 
but, in our terminology they made the sums of errors appear negative 
except in series D of the main study. It was despite this artifact that the 
effect of the light on the tone was evident also in the control study, a 
result suggestive of its strength. 

If, in fact, subjects do tend to associate in other ways, stimuli which 
are only temporally associated when cues are reduced to a minimum, 
the process could be regarded as reflecting an existing established con- 
ditioned response. In the general experience of the developing organism 
when one thing follows another closely in time, they frequently are 
associated in other ways; one aspect of the sight of a mother, for 
example, and the sound of her voice, is a purely temporal association. 
Many other things are coincident in time and are not associated but 
such sheer coincidences come to be ignored as the result of a discrimina- 
tion process, one might suppose. In the absence of other cues to effect 
discrimination, to respond to one stimulus that always precedes another 
as if they were the same stimulus, is clearly consistent with a condition- 
ing paradigm. The missing element is that of reinforcement. In our 
investigation there is no ostensible reinforcement. By displacing the 
tone towards the light the subject gets no reward, avoids no punishment 
or, as far as one can see, achieves no end at all. If conditioning had 
occurred by virtue of the spatial pairing alone one would have had an 
example of conditioning by virtue of contiguity (Guthrie, 1952). If our 
Procedure is simply reflecting a general conditioned response, built-in 
during development, then one cannot say whether reinforcement has 
Occurred or not. In general one would have expected that reinforcement 
would have played its part in the establishment of this response. 

Turning now to the contents of Table 3.7 one must again report a 
general lack of sizeable coefficients between the ; 
personality variables. Even on one's revised concept of the process, sig- 
nificant relationships would still be expected. However, only No. of + 
Shows any significant correlations and these are difficult to interpret. 
One coefficient was predicted, that with the E scale of the MPI but the 
other is with the lie scale of the CPF and these two personality variables 
are virtually uncorrelated in this study (r = —0-013). If the coefficients 
are reproducible ones then the process must be related to some factor, 
Other than extraversion, related to the D scale. This is not neuroticism 
as neither E nor D are correlated with N in this study. 

The possibilities of this procedure for examining what could be the 
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reflections of a very general conditioned response established during 
early development seem impressive to the writer, although the experi- 
ence of this investigation has indicated that improvements in apparatus 
and technique are called for. Even the nature of the general response, 
if such there be, is doubtful. It would be quite consistent with such facts 
as there are, if one were to suggest that the true state of affairs was one 
essentially of discrimination learning. Such refiections serve to empha- 
size how far one has to go before the relationship of even the simplest 
laboratory response to general qualities of behaviour, is understood. 


GENERAL CONCLUSIONS AND SUMMARY 


In the four investigations reported in this section, the relationship 
between four associative procedures and measures of the personality 
dimensions Introversion/Extraversion and Neuroticism has been ex- 
amined. Of the four procedures, eyeblink conditioning, salivary condi- 
tioning, verbal ‘conditioning’ and spatial ‘conditioning’, the latter two 
did not seem to possess the special properties of conditioning proce- 
dures, at least in the form in which they were originally conceived. After 
closer acquaintance with these two procedures it was decided that verbal 
‘conditioning’ (probability learning) could best be conceived as a type 
of trial and error learning and that spatial ‘conditioning’ seemed, not 
to involve the establishment of a new conditioned response but to be 
eliciting existing conditioned responses established during development. 
Of the four procedures only eyeblink conditioning and salivary condi- 
tioning possessed the properties of conditioning procedures. Despite 
encouraging results from a pilot study, control studies cast considerable 
doubt on whether, over the whole group, the performance phenomena 
under examination in salivary conditioning and spatial ‘conditioning’ 
were obtained. 

In the case of salivary conditioning a nearly significant difference 
between the amount of salivation to the conditional stimuli and the 
‘control’ stimuli was obtained in the control study, which difference was 
in the direction of indicating that the conditional stimuli elicited more 
salivation in the absence of any reinforcing schedule. Essentially the 
same result was obtained in the control study of spatial conditioning 
although this could be interpreted as indicating the operation of the 
same tendency to respond as was suggested for the main study. We have 
characterized this tendency as one in which events associated only in 
time, in the absence of cues to the contrary, are responded to as if they 
were associated in other ways. 

Bearing these complications in mind, the relationship of various 'con- 
ditioning’ measures to a set of personality questionnaire measures was 
examined in the form of Pearson-product moment correlation coeffi- 
cients. The personality scales used were: HY (hysteria scale of MMPI); 
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Pd (Psychopathic Deviation scale of MMPI); E (Extraversion scale of 
the Maudsley-Personality Inventory); C (of Cattels’ Contact Personality 
Factor Inventory); Pt (Psyaesthenia scale of the MMPI); K (The Faking 
good/Faking bad scale of the MMPI); D (the ‘lie’ scale of the Contact 
Personality Factor Inventory) and N (the Neuroticism scale of the 
Maudsley Personality Inventory). The scales of the MMPI had been 
shown to be related to extraversion and neuroticism in a previous study 
by Franks, Souief and Maxwell (1959). The intercorrelations of the 
questionnaire measures in this study did not bear this earlier work out 
in every detail. 

The following significant correlations were found between the per- 
formance measures and the personality scales: 


Salivary Conditioning 
(T — C) 

(a) X N 

Ius and control situation)—with K, r — — 0:319. 

(b) No. of + (number of times salivation to conditioned stimulus 
exceeded control measure irrespective of amount)—with Pt, 
r = + 193, with K, r = — 0:349, with D, r = — 0222. 


, It was concluded, on examining these relationships, that, if any of the 
indicated results were reproducible, neuroticism and salivary measures 
Were positively correlated. In the absence of any corroborative evidence 
andin the presence of contradictory facts, however, itseemed mostlikely 
that these correlations would not prove reproducible and, ín view of 
the control study results, did not, in all probability, involve a con- 
ditioning measure. 


(mean differencein salivation between conditioned stimu- 


Verbal Conditioning 


(a) Change in left choices (gross change 
made from first schedule with the 0-5 ratio 
a ratio of 7:13 lefts to rights)—with Pt, r= 
pes 0-21. 

(b) Variability (sum of squared deviations of no. of left r 
Over seven blocks of twenty trials from corresponding points on straight 
line of best fit to these seven points)—with HY, r = -- 0-218, with Pd, 
r= + 0-220. 

Because the correlations with Pt and N were such as to suggest a 
négative relationship between change and the scales, it was suggested 
thata rigidity factor was being measured, rather than one of neuroticism. 

cause Hy and Pd do not correlate, in this study with extraversion or 


Fears i measure, the meaning of 
nor with any other performance , the 
4 i es remains obscure. 


the relationship be E hese seal 
tween variability and these r 
On the evidence of Franks, Souief and Maxwell (1959) it suggests that 
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the more extraverted subjects were more variable. This relationship was 
also obtained in the pilot study. 


Spatial Conditioning 

(a) No of + (count of the difference between the number of error dis- 
placements of tone towards a light before and after a series of light 
and tone being spatially paired)—with E«(of MPI), r = —0:250: 
with D, r = —0:222). 


Again, since E and D are uncorrelated it is a little difficult to see what 
their relationship with the spatial conditioning measure could mean. The 
negative correlation with E was predicted but as it stands alone we must 
reserve judgement until this correlation is reproduced in another study. 


Eyeblink Conditioning 

No personality scale in this study correlated significantly with eyeblink 
conditioning. It has been shown that when normal subjects have been 
employed this result is consistent with three other similar studies in 
which no significant correlations are reported. The exception is a study 
by Franks in which a significant correlation of —0-46 was reported 
between extraversion (E of MPI) and eyeblink conditioning in a group 
of 57 students. 

A z transformation was performed on the coefficients and the esti- 
mated true correlation between eyeblink conditioning and extraversion 
proved to be a low negative correlation significant for the number of 
cases being considered. 

Reasons for low values of the obtained coefficients have been dis- 
cussed, particularly in regard to unknown environmental conditions 
and the inadequacy of the principal questionnaire scales, the Maudsley 
Personality Inventory. The former factors are judged of possible but 
unknown importance, but evidence is brought to bear on the latter 
contention which supports the suggestion made that the E scale of 
the Maudsley Personality Inventory is a poor criterion in differentiating 
groups of introverts from groups of extraverts. 

At the level of analysis attempted in this study, then, our conclu- 
sions must be negative. Taken as separate studies the four procedures 
did not reveal performance scores which significantly correlated with 
measures of extraversion or of neuroticism in any consistent and mean- 
ingful way. The relationships that were found seemed to be of the type 
that could have been thrown up by chance in a large matrix, at best 
their meaning was obscure. However, that this study does not conclu- 
sively settle the issue of whether conditioning measures correlate with 
extraversion and neuroticism, is clear when one considers that only 
eyeblink conditioning and salivary conditioning possessed the proper- 
ties of conditioning procedures, in the form they were employed and of 
these, conditioning was only indisputably achieved in the former. 
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PLATE V. Verbal Conditioning Apparatus. Subject on left, experimenter on right. 


acces ee 


PLATE vi. Spatial Conditioning Apparatus. 
Showing chair for subject, subject’s scale and 
experimentee's setting scale. 


PLATE VII. 


Spatial Conditioning Apparatus. 

Rear view showing moving tone and light 

sources and duplicate of subjects scale for 

checking the position of moving stimuli 
sources, 


a x ES 


PLATE vill. Figural After-effect Experiment showing position of subject in the centre, with Com- 
parison of Circles Test on his right and Brightness Comparison Test on his left. 


PLATE Ix. Subject’s view of tests. Extreme right: mirror reflecting Comparison of Circles Test; 
extreme left: projection screen of Brightness Comparison Test. Centre left: Rotating Spiral 
Test; centre right: Dark Vision Test. 
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MEASURES OF PERCEPTUAL 
FUNCTIONS 


H. C. Holland 


Introduction 


SHOULD it prove possible to establish experimentally a general feature 
of several perceptual tasks which is related to a relatively well defined 
personality dimension, three things would have been accomplished: 
(a) it would extend the scope of the theory of personality from which 
the existence of the dimension can be derived; (b) it would permit a 
rapprochement between perceptual task performance, and performance 
in those other sectors of behaviour to which the theory has relevance; 
(c) it would permit ‘personality’ to be used as an integrative concept in 
Psychology accounting for the differing degrees of generality found in 
the behaviour of an individual (or type of individual) from one situation 
to another. 
The feasibility of relationships between perception and personality 
Stems from several sources. (I) In the first place, the relationship can 
be inferred from the establishment of genetic determinants in both per- 
ception and personality (McLeod, 1954). Few modern psychologists 
would deny that such a relationship exists although they may disagree 
about the contribution made by endowment in either case. (mA second 
Source of evidence is the large number of investigations which have 
outlined differences between clinical groups, and perceptual abnormal- 
ities of a psychogenic origin. (See for example, Eysenck, Granger and 
Brengelmann, 1957.) (III) There exist à group of theories aimed at 
the explanation of changes in performance consequent upon ign 
Stimulation and response, Pavlov (1927), Hull (1943), Eysenck (1953) an 
Kohler (1944). These are generally referred to as ‘inhibition’ theories, 
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and are based upon the proposition that a positive force exists which 
acts against evocation and which is due to either preceding work or 
excitation. (IV) Finally, there is the direct experimental evidence and 
general characterizations of perceptual task components (Thurstone, 
1943, Eysenck et al., 1957) represented by ‘factors’ which in themselves 
reveal nothing of the causal relationship until predicted by a causal 
hypothesis. d 

The most important of the sources outlined is the theoretical one. 
Without theory there can be no causation and only the existence of 
a theory allows ‘causal’ statements to be made. The theory within 
which the facts of the present study will be interpreted is a polygamous 
association between Pavlov, Hull, Kohler and Eysenck, the theory of 
the latter having the widest application in the field of personality study 
and being a development and modification of the theories of Pavlov and 
Hull. Kohler's theory, on the other hand, has been concerned mainly 
with changes in perception which follow continued stimulation; al- 
though its formal and functional identity with similar concepts in 
learning and personality have been outlined (Eysenck, 1955) and (Dun- 
can, 1956). Hence differences in the inhibitory substratum which 15 
postulated by Eysenck to account for differences in behaviour are applic- 
able to the perceptual phenomena outlined by Kohler. - 

According to Eysenck's theory differences in the excitation/inhibition 
balance of the mechanisms subserving behaviour may be assessed on the 
behavioural level by introversion/extraversion, and a continuum (or 
dimension) corresponding to this personality variable is consonant Wit 
a continuum of 'inhibition'; the extravert being high on inhibition and 
low on excitation and the introvert being essentially the opposite. Unlike 
Eysenck, Kohler has not been primarily concerned with individual 
differences but has postulated that constant stimulation of a sensory 
surface leads to a state of localized fatigue (inhibition) in the mediating 
mechanisms of the cortex which has a measurable effect on subsequent 
stimuli presented in and near the same arca (Kohler, 1944). Nevertheless, 
Kohler's initial experimental results show a wide range of individual 
differences and the phenomenon of figural after-effect has been employ’ ed 
quite widely as a measure of inhibition in several pathological and 
normal groups (vide infra). 

Now it follows from the postulated identity between other inhibitory 
effects and satiation and their relation to extraversion, that differences 
in satiation must be paralleled in some measure by differences in extra- 
version. Indeed Eysenck, arguing from the same premise, has asserted 
that satiation effects should appear earlier, they should be stronger an 
that they should disappear more slowly in a hysteric group of subjects 
Eysenck, 1955): where the hysteric represents an extreme group for 
extraversion. 

The present study makes essentially the same prediction as Eysenck 
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concerning extraversion in a normal (unhospitalized) group of young 
males, although several of the perceptual tasks assigned involve changes 
which are not clearly accountable within the Kohler Wallach hypothesis. 
The relevance of satiation theory to each test will be discussed in each 


case. 


A battery of six tests was employed, each giving a measure which is a 
function of the existence or build-up of inhibition. Of these six, two 
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the obtained results. The first examination takes the simple form of the 
description of the distribution and the comparison of two selected 
groups assumed to represent high and low estraversion groups because 
of their scores on the Maudsley Personality Inventory. Secondly, there 
is an examination of the correlational aspects of the investigation which 
outlines the relationships between the test scores and the questionnaire 
criteria as well as the more important correlations between different 
experimental variables. The factorial analysis of the intercorrelations 
is dealt with later on. (P. 234; cf. also Fig. 4.1.) 

Several derived scores have been used in both of the examinations 
referred to and it may be useful at this juncture to designate each score 
and what it represents. 


(1)! 1. Dark adaptation. The score, in minutes, taken by the subject 
to achieve four correct discriminations at the level noted in 
the text. 

(22) 2. Auditory flutter fusion. Y. The threshold of frequency dis- 
crimination. 

Q3) 3. Auditory flutter fusion. YI. The differential threshold for fre- 
quency discrimination. Defined by the difference between 0% 
and 100% ‘correct’ responses. 

Q9) 4. Spiral after-effect (Height).? The maximum after-effect dura- 

tion. This may be defined as; height; a = y, + Ya + ys + Ya 
+ ys and is the average value of y calculated as a/5. 

(30) 5. Spiral after-effect (Slope). Gives the net increase or decrease 

in y from the first to the fifth reading. It thus defines the slope 

or the line connecting y, with Ys- Slope therefore = b, 
=r cJ 

Spiral after effect (Distance). Defines 

€ = {402 + ys + Ys) + 6b — 125.) 
V16 4- b? 

and represents the perpendicular distance of the intermediate 

points, yo, y; and y, from the line of the slope: positive when 

they are above the line and negative below. 

Fixation. The sum of the two one-minute trials, i.e. the total 

number of times the subject breaks fixation. 

(55) 8. Figural after-effect (Contour). The difference between the pre 
and post satiation judgements on ascending trials. 

(56) 9. Figural after-effect (Contour). The difference between the pre 
and post satiation judgements on descending trials. 


1 The numbers contained in brackets are those by which the tests are denoted in 
the factor analysis in the section dealing with this aspect of the investigation. 

3 This method of summarizing the spiral after-effect scores was suggested by 
Dr. Patrick Slater to whom acknowledgement is made for both this and other 
statistical advice in the production of this report. 
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(57) 10. Spiral after-effect. The sum of two one hundred seconds trials 
of prior stimulation. 

(58) 11. Figural after-effect (Brightness). The difference between the 
first and the third pre, and the fifth and the seventh post 
satiation judgements. 

(59) 12. Figural after-effects (Brightness). The difference between second 
and fourth pre, and the sixth and the eighth post satiation 
judgements. 

(45) 13. Intelligence (IQ). The raw scores on group test 90A of the 
National Institute of Industrial Psychology. 

(46) 14. MMPI. The raw score on scale Hy. 

(47) 15. MMPI. The raw score on scale Pt. 

(48) 16. MMPI. The raw score on scale Pd. 

(49) 17. MMPI. The raw score on scale K. 

(50) 18. Cattel’s (CPF). The mean of the ‘E’ scale scores on both forms 
of the CPF. 

(51) 19. Cattell’s (CPF). The mean of the *D' scale scores on both 
forms of the CPF. 

(52) 20. Maudsley Personality Inventory. 
(E?) scale. 

(53) 21. Maudsley Personality Inventory. Th 
(‘N’) scale. 


The score on the Extraversion 


e score on the Neuroticism 


Subjects 


The total data available are for 97 subjects, divided into two groups of 
17 and 80; the smaller constituting the experimental group for the pilot 
study which preceded the larger. The first sample were all male graduate 
or undergraduate members of the University of London reading for a 
diploma in Education. They were all unaware of the purposes of the 
experiment and were paid for their services. Their ages ranged from 
twenty to forty (Mean 29:17, S.D. 6:28). The present tests took o 
one hour and thirty minutes of a seven-hour testing programme. The 
second group of cighty were all young industrial and student apprentices. 
They were again all males with ages ranging between sixteen and "S 
teen. All were employed by the Ford Motor Company who dopat 
their time; the testing programme being carried out on a normal working 
day. This sample, therefore, received no direct payment for their services, 
except for meals and expenses. 

Differences between the two groups cannot be ruled out. ae scone 
group is younger, probably less intelligent, and there may ae e 
slight difference in motivation. The actual values quoted in the tex ae 
for the larger population. The smaller group scores have, where app! 
Priate, been appended in parenthesis. 
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The test situation 


All the tests to be reported were conducted in the same room, which 
was 9:33 x 10-25 ft. square. It was 8-33 ft. high and thus had a capacity 
of 796:83 cubic feet. Within the limits of permissible expenditure the 
room was prepared for the present investigation. Both windows were 
shuttered and light tight, and walls and ceiling painted matt grey to 
prevent the reflection of extraneous light from instruments. The edges 
of the doors were sealed and the doorway was overhung by a heavy 
curtain. The corridor outside the room was carpeted to prevent exces- 
sive noise and a red light warned passers-by that an experiment was in 
progress. The room was ventilated by an electric fan which maintained 
a continuous level of sound, and heated by hot water pipes. Both 
temperature and humidity were kept at a relatively constant level 
throughout the investigation. 

All tests but one (AFF) were so arranged that they could be per- 
formed from the same position (see Plates VI and VII). Once the subject 
was seated in the combined chair and head rest he was not disturbed 
until the battery was finished. (He could, of course, relax from any 
position of strain during the rest periods between tests.) 


Questionnaires 


Several questionnaire measures and an intelligence test score were 
taken as part of the present investigation. 

MPI (Maudsley Personality Inventory). This questionnaire has been 
used extensively as a measure 
of its standardization and 
Eysenck (1956), Eysenck 
The two scales which have 
The inventory contains 80 
ticism; there is a lie scale 
items are highly correlated 
lated with others, maximizi 


Cattell's CPF (Contact 
of extraversion. Essential 
ring to the degree of ‘soci, 
to find it personally satisfs 
are two forms of the CPF, Forms A and B. 
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sonality test requires no elaboration here. he interested rea i 
Ei ae to Hathaway and McKinley DD de the uses eter 
d the inventory.) Four scales of the test were used, namely, Hy, Pd, 
; and K, representing neurotic extraversion (Hy), Psychopathy (Pd), 
neurotic introversion (Pt), and attitude to test items (K). 
Unfortunately the MMPI scales, unlike the MPI, are not factorially 
on The work of Comzey (1958) at the University of California has 
own that each of the scales used in the present study may be com- 
posed of as many as fifteen factors and that many identified factors are 
common to several scales. 
ms measure of intelligence available for the industrial group was the 
IIP Test 90A. This is a group test of verbal intelligence which is ad- 
d to all apprentices of the training school. It comprises four 
e (1) same and opposite; (2) Analogies; (3) Jumbled sentences; (4) 
e completion. There are forty items in the first three scales and 
Fidis een in the last. Raw scores only were used, although the test 
ie vs an adjustment for age. As indicated already, there was no 
ntelligence quotient available for the University population. 


Results 

The means and standard deviations of all the questionnaire scores 
are given in Table 4.1. These indicate that on the MPI the experimental 
sample is slightly more extraverted than the general population when 


TABLE 4.1 


Questionnaire Scores 


Means and standard deviations of scales on MMPI, MPI, CPF, and the 


IQ measure, NIIP Group Test 90A, N = 80 


Mean S.D. 
MMPI Scale Hy 16808 — 382 
8 Pt 9667 622 
5 Pa 12256 3771 
5 K 1509 42^ 
CPF (both forms) ^» E 37:968 4-626 
5 D 6448 2172 
MPI " E 29295 836 
5 N 2098 1012 
IQ Raw Score 82269 13706 


compared with existing normative data (Jensen, 1958), but not more so 
The sample is almost at the 


than normal American University students. K 
mean on the neuroticism variable, the mean being derived from a large 
population from which the sample was drawn. No strict comparison 
exists for such a homogeneous sample on the CPF and the MMPI 
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although the fact that the first standard deviation on either side of the 
CPF mean span ‘ranks’ 5, 6, and 7 support the MPI scores in indicat- 
ing that the sample is slightly more extraverted than a normal group 
(Cattell, 1954). 


Dark Adaptation 


A necessary control which has to be exercised in visual experiments 
of the kind dealt with in this chapter, is the state of adaptation of the 
retino/cortical mechanisms. This is particularly the case in figural after- 
effects and other measures of satiation. Kohler and his associates have 
argued that satiation is not a transient effect and that a certain amount 
depends on the permanent state of the (visual) cortex. This residual 
satiation they call ‘permanent’ or ‘statistical’, and it is caused by the 
nearly constant impingement of visual stimuli upon the receptors. It is 
a state which will fluctuate moderately as sleep and unusual environ- 
mental conditions raise or lower the average intensity of figural currents. 
Attempts at validating the concept of statistical satiation have not met 
with any great success, e.g. when differences between early morning and 
late night measures are taken (cf. also Heron, 1956), but it is consistent 
with the notion of differential conductivity as postulated by Klein and 
Kretch (1952). 

It is postulated that where resistance is high, figure currents will be 
weaker, and hence polarization by such current will be less. It would, 
therefore, follow that the individual in possession of a high level of 
permanent satiation should produce less effect from the imposition of 
a standard stimulus, which would represent merely an increment added 
to the general level, and any displacement of a subsequent stimulus 
should be less profound. 

The influence of statistical satiation can probably be investigated by 
adequate manipulation of the environment but where this is not pos- 
sible, it can be controlled, at least to some extent, by trying to ensure 


that all subjects come to the test in the same state of adaptation. The 
condition can, of course, onl 


0 n y be a rough approximation because of 
differential rates of adaptation (see Granger (1953), for a good discus- 
sion of individual differences in dark vision), but if intensities are set 
which take only short periods and thus do not capitalize on such differ- 
ences, the control will be of some value. The time taken to achieve dis- 
crimination at a constant intensity may also prove a valuable measure 
inits own right, particularly in a case such as this where all subjects were 
seated in à constant room lighting for some 25-30 minutes before dark 
adaptation was begun, and thus came to the test in rough equality of 
light adaptation. 


A test of dark adaptation, even of the limited type described, in the 
present battery serves two purposes. In the first place it permits some 
control of an experimental variable which will effect satiation processes 
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in iui 

mid ud = goed it will act as a measure of extraversion and 

Toadings e imensions having been shown to have high factor 

Pe is function (Himmelweit, 1946). Because of its multi- 

dens mposition the test cannot be expected to correlate very signi- 
ly with either of the questionnaire dimensions. 


Apparatus x 
The instrument used in this test is one which w: i 
u as specially constructed 
rg a (Gravely, 1950). It was an inis Research 
com: ry Adaptometer Mk. la, covering a wide range of illumination 
B Wo epe 3-961 x 1077 to 7:847 x 107? e.q. f.c. by means of 
The 6 A n cn source, three filters, and sixteen graded apertures. 
selon .C. supply was controlled by rheostat and voltmeter. The 
inire cie has a uniformly illuminated 51-in. screen (va. — 45?) 
i ene : jy a sector which may be rotated into eight positions. The 
fà S in this investigation was 10-4 x 0:60 candles/sq. ft. 
the nose point was attached to the apparatus just above (14 in.) 
position mp screen and controlled by rheostat to give the minimum 
The m t. The stimulus thus fell in the lower part of the visual field. 
disi ug ject was required to fixate the red light during the period of 
Dus ees and to report, as soon as he was able, the position of 
nolit or. A successful response led to a change of the sector to another 
ie on. Changes wererandom and fourcorrect responses Were required 
re the test was considered completed. 


Results 


a Pg of dark adaptation scores h 
a stand a inme: The mean of the distrib 
same t ard deviation of 1-705 minutes. For the sake of completeness the 
also Wo extreme groups which are compared on the other tests were 
con: compared on this measure, although dark adaptation cannot be 
i sidered as a simple measure of inhibition. Scores for the two groups 
ere as follows: 
NE. E:4:509, S.D. = 1:833, high E: 4213, S.D. = 1:673. Sec Fig. 4.2 
Th e total and selected groups distributions. . 
diffe ese means and standard deviations are essentially the same, the 
S aee between them (0-296) being insignificant. — mu 
"s. s was expected, dark adaptation is not correlated ith the criterion 
si i of extraversion. There are, however, several interesting and 
iud b cant correlations with other test variables, i.e. height (r = — 0-208) 
D oth thresholds of the contoured figural after-effect (Asc. r = 0:259) 
esc. r = 0-242). 
tion and the comparison of 


The two correlations between dark adapta 
with the theory advanced by 


duals who take considerable 


as shown a wide range from 
ution is 3:858 minutes with 


" 
ca scores are, of course, consistent 
er and Wallach. That is to say, indivi 
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: ination le 
periods to adapt to an initially sub-threshold constant ene by 
might be said to be dissipating inhibition which has been que scores 
preceding stimulation. It can only be assumed that a. developed 
represent the relative ease with which satiation weer been gen- 
or the relative persistence of such potentials once they week to decay 
erated. Conditions of both ease of generation and resistan 


vel 


DISTRIBUTION OF DARK 
ADAPTATION SCORES N=80 


SELECTED HIGH E GROUP 
N=23 


dH ele, 5 


SELECTED LOW E GROUP 
N-23 


q-Qüms[*pnp.g. 


S ic] * 3 S 
TIME IN DARK IÑ MINUTES 
FIG. 4.2. 


70 


> 
LI 


of the effects of stimulation might le dt arge and 
persistent figural after-effects. : SESIMGERIENGRGR p 
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Spiral After-effect 

The physiological basis of the after-sensation of movement, following 
the fixation of a rotating spiral or striped drum, is unknown. All that 
can reasonably be said of it is that it is one of the several illusory effects 
which are subsumed under the general heading of after-sensations of 
apparent movement. Several of the factors which influence the occur- 
rence and persistence of the apparent movement have been delineated, 
and it is known and employed quite extensively because of its simplicity 
of experimental management and its reliability and scope for re- 
search into individual differences with both normal and pathological 
individuals. 

_ The history of apparent movement certainly antedates psychological 
literature and probably antedates written history (Wohlgemuth, 1911). 
In more recent times many well-known figures in psychology have in- 
vestigated apparent movement (Purkinje, Adams, Muller, Helmholtz, 
Wundt). These investigators used mainly the waterfall" illusion and it 
is to Plateau (1850) that is accorded the construction of the first arith- 
metic spiral; although his primary interest lay in the colour effects which 
can be induced by rotation of the black and white disc. 

In the present century the earliest systematic investigation of the 
Spiral phenomenon was conducted by Wohlgemuth in 1911. Outlining 
the existing theories, which were based on a multitude of different con- 
cepts from the 'passage of after-images across the retina’ to the ‘incor- 
rect interpretation of sensations’, Wohlgemuth embarked upon a series 
of some thirty-four experiments in order to determine the relative effect 
of contributing variables. Unfortunately, the conclusions which he drew 
from his investigations have not the clarity of his experiments. Indeed, 
he postulates a complex network of cortical and subcortical centres, 
suggests that the spiral after-effect is similar to the spinal reflex, and 
postulates a ‘centre of movement’, to account for the phenomenon. 
Despite theoretical weaknesses the empirical results of Wohlgemuth’s 
work were most valuable in outlining the importance of such variables 
as brightness, direction, velocity, and area of retina stimulated, and 
he made the useful suggestion that the after-effect might be employed 
in the investigation of the ‘optic tract’. 

Since Wohlgemuth's experiments the spiral phenomenon has been 
studied extensively by Hunter (1914 and 1915); Thalman (1921); Granit 
(1928); Grindley (1930); Gates (1934); Deutsch (1950); Walls (1953); 
Holland (1957,1958) and Eysenck and Holland (1957), each in turn being 
interested in a slightly different aspect. These experiments have done 
little to clarify the nature of the phenomenon, although certain basic 
parameters are seen to emerge. For instance, little doubt can now be 
held concerning the central nature of the after-effect (Holland, 1957), 
(Walls 1953), cf. however, Hunter (1914). Another important parameter 
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is the visual angle subtended. The optimal retinal size appears to be 
in the region of 2-4° va. 0° pa. (although it appears that this magni- 
tude may be increased without any significant shortening of the effect 
(Holland, 1958), but it does not seem that this is a rod-cone effect as was 
previously held (Granit, 1928; Grindley, 1930). Within a wide range the 
speed of rotation of either spiral or drum appears to have little or any 
effect so long as the subject is able to discriminate the spiral from its 
background. Indeed, if Deutsch's 1950 results can be repeated (and the 
present author has failed twice under slightly different conditions) objec- 
tive rotation of the spiral disc may not be a necessary condition of the 
after-effect. Deutsch found that intermittent illumination of the sta- 
tionary disc not only gave the illusion of rotation but also produced the 
characteristic after-sensation of movement. (In a private communica- 
tion Deutsch reports that this effect has also been obtained by Grindley.) 
The after-effect occurs when the gaze (post-rotation) is projected upon 
parts of the environment other than the inducing or a similar spiral. 
Whether the effect occurs independently of a structured stimulus pat- 
tern is in some doubt. Holland (1957) has reported that no effect was 
noted by a small group of subjects when their post-rotation fixation 
was in a dark grey homogeneous background with low illumination. 
Grindley and Wilkinson (1953), however, report that an effect was ob- 
served when post-rotational fixation was on a white ivorine disc. Fixa- 
tion seems to be important, despite early studies which showed the con- 
trary (Thalman, 1921). Holland (1957) reports that if the fixation point 
is changed randomly every five or six seconds, no after-effect follows 
long periods of stimulation. Finally, the most important source of vari- 
ation in the after-effect is the time through which the initial stimula- 
tion is continued (Thalman, 1921; Holland, 1958). The growth in the 
persistence of the after-effect is, however, not linear with time and 
several studies by the writer indicate that persistence scores tend to fall 
off after stimulation periods in the range 80-100 seconds (Holland, 1958). 
Because of the wide range of differences in after-effect persistence the 
spiral has been employed as a method of reliably assessing individual 
differences. Perhaps the first systematic investigation in this field is the 
now famous study by Thurstone (1943). The test (No. 24 in his battery) 
was unfortunately dropped from his final analysis although it correlated 
with other perceptual tests like flicker fusion, necker cube, and size 
constancy measures. The present author has attempted to use spiral per- 
sistence as a measure of differences in the excitation-inhibition balance 
which, on Eysenck’s theory (Eysenck, 1955), is the neurological charac- 
teristic subserving the differences between introverts and extraverts 
(Holland and Beech, 1958), Adducing evidence from the relationship 
between inhibitory effects, their increase with depressant drugs and 
decrease with stimulants, Eysenck and Holland (1957) have shown that 
the after-effect is modifiable in accordance with the extraversion-drug 
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hypothesis, i.e. that both extraverts and those ‘extraverted’ by sodium 
amytal, have shorter after-effects than introverts or those ‘introverted’ 
by excitants. 

Several recent studies have attempted to employ the existence or 
duration of the spiral after-effect as an index of either memory impair- 
ment or ‘organicity’. This has proved largely unsuccessful in the study 
of memory (Freeman aad Josey, 1949; Standlee, 1953) but, despite 
methodological weaknesses, somewhat useful in the diagnosis of brain 
damage, Bender and Teuber (1947, 1948, 1949); Price and Deabler 
(1955); Gallese (1956); Page et al. (1957); Holland and Beech (1958) and 
Goldberg and Smith (1958). The results of these last named studies have 
indicated that organic patients perceive the spiral after-effect less 
frequently and for shorter periods than normals; cf. however, Philbrick 
(1959) and Johnson et al. (1959). - 

According to Eysenck (1956) and Shapiro (1953), the characteristic 
differences shown by brain damaged subjects, when compared with 
normals, are due to an intensification of inhibitory effects; Shapiro pre- 
ferring the Pavlovian concept of negative induction (the irradiation and 
concentration of inhibition) and Eysenck a modified Hullian concept of 
reactive inhibition. Predictively these two notions have many similarities. 
Indeed, it follows from the theory of personality advanced by the latter 
author (Eysenck) that brain damaged subjects and neurotic extraverts 
(hysterics and psychopaths) should have many behavioural similarities 
(Eysenck, 1955), and thus many of the predictions which may be made 
of brain damaged patients may also be made of extreme extraverts and 
vice versa. This is a proposition which has been tested and in some 
measure confirmed by the work of Petrie (1952, 1953 and 1953b). Now 
it is clear from the established association between brain damage and 
spiral duration, and between brain damage and extraversion, that the 
duration of after-effect in a normal population will be proportional to 
the degree of inhibitory potential with which individuals are endowed 
and which may be assessed on the behavioural level by their degree of 
extraversion. At the test level this proposition results in the specific test 
hypothesis: The duration of the after-sensation of movement following 
stimulation by a rotating spiral will be negatively correlated with the degree 
of extraversion. 


Apparatus (Spiral) 

The spiral and its driving mechanis 
where Holand, 1956) = Holland and Beech (1958). (See Eysenck 
(1957) for a photograph of the apparatus and disc.) It was a ge d 
180? logarithmic type which is sometimes called the aay piral. P 
was 81 in. in diameter and from the viewing distance subtended an ang 
at the retina of 6:7°. Rotation was supplied by an electric motor con- 
trolled through a rheostat. Speed of rotation is maintained at a constant 
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level by reference to a built-in strobe-disc and neon mounted on the 
drive shaft at the rear of the disc. This method permitted a standard 
rate of 100 r.p.m. to be maintained within about 1%. Illumination was 
given by two sources of 60 watts A.C. placed 12 in. below the centre 
point of the disc. The light from the two sources being contained in 
tubular housings, which did not allow light to leak backwards in the 
direction of the subject, and projected it forwerds as a slightly eliptoid 
area a little bigger than the disc. The spiral was held on the rotating 
shaft by a chromium-headed screw which also served as a fixation 
point. 


Procedure 


The subject was seated in a combined chair and chin rest. He had 
been given, before testing began, a demonstration of the after-effect in 
the following terms: ‘This is a demonstration of a test you will be doing 
later. I want you to keep your chin on the chin rest and your eyes fixed 
on this small spot you see shining in the centre of the spiral.’ The spiral 
was then switched ‘on’ and after about 15 seconds: ‘Don’t move your 
eyes . . . in a few seconds I am going to stop the disc from rotating. 
When I do you will have an after-effect. Don’t take your eyes from the 
centre of the spiral and you will see that the after-effect continues for 
some time. All I want you to do is to tell me, by saying “stopped”, as 
soon as the effect finishes. Don’t take your eyes off the centre of the 
spiral until I tell you to do so.’ At this point the spiral apparatus was 
switched ‘off’ and the subject reported the end of the effect. The last 
injunction of the instructions was added because several subjects in the 
preliminary studies had been ‘fooled’ by the very rapid contraction in 
the early moments following the cessation of the objective rotation and 
had failed to observe the slower contraction which follows it. 

The subject was then asked to describe the phenomenon and the 
experimenter ascertained that he was in fact perceiving the characteristic 
after-effect. There was never any difficulty in this respect. 


The experiment 


In the experiment proper the subject was in darkness adapting to the 


standard level following the Comparison of Circles Test. During this 
period the following instructions were administered: ‘You will remem- 
ber the demonstration, given earlier, in which I asked you to report 
when the after-effect of contraction finished. . . . This is the test proper. 
It is exactly like the demonstration, Remember to keep your eyes fixed 
on the centre of the spiral both during and after the objective rotation 
and tell me as soon as all movement on the surface of the disc com- 
pletely stops.’ 
At this point the motor was switched ‘on’ and when the strobe disc 
indicated the correct number of r.p.m. the lights in front of the apparatus 
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were switched ‘on’ simultaneously with a stop-watch. At the end of a 
hundred seconds the machine was stopped and the subject reported the 
cessation of the effect. This was the first trial and it was followed by a 
three-minute rest interval; again in complete darkness. The rest period 
was followed by a second identical trial. 

The second part of the spiral experiment came at almost the end of 
the battery, immediately following the brightness figural after-effect. 
The only difference between the trials of the second part and those of 
the first lay in the fact that stimulation times were not held constant 
but increased progressively. Thus, the duration of the after-effect was 
assessed after 15, 30, 50, 80 and 100 seconds of stimulation. The rest 
periods were also different, and this time only one and a half minutes 
were allowed instead of three minutes. 

Two types of measurements were taken of the after-effect persistence. 
The first was based upon the two trials of 100 seconds prior rotation. 
The second five trials gave measures which were taken after 15, 30, 50, 
80 and 100 secs. of rotation. This latter method stemmed from the 
results of previous experiments which had indicated that a plateau of 
persistence time was achieved as prior rotation periods were increased 
and that the rate of acquisition to this plateau might represent an indi- 
vidual or personality variable. From these five measures, therefore, 
there was extracted three derived scores which were labelled height, 
slope and distance. 

These derived scores are defined as follows. Y1, Y2, Y3, Y4 and Y5 
represent the recorded measurements for a subject on the five trials. 
The five scores were then converted into: (a) height, a — Y1 + Y2 
+ Y3 + Y4 + Y5/5, representing the average value of Y; (b) slope, 
b = Y1 — Y5 and gives the net increase or decrease in Y from the 
first reading to the fifth and thus defines the slope of the line connecting 
Y1 with Y5; (c) distance, c = (&Y2--Y3 +-Y4)+-6b—12y1)/V 16--b* 
and defines the total perpendicular distance of the intermediate points 
Y2, Y3 and Y4 from the line, it is thus a measure of curvature of the 


growth function from the slope. 


Results 

The first of the spiral scores are the two trials to assess the after- 
sensation persistence following 100 seconds of prior rotation under ye! 
conditions previously described. The first trial results in a men a 
24-013 secs. (S.D. = 8-045) and the second trial had a mean of 2 os 
secs. (S.D. = 7:836). The inter-trial reliability of the test was r = o3 n 
For correlational purposes the two trial measures were combined to 


give a mean of 24-179 secs. (S.D. — 7:699). The pilot study sample had 


a mean of 26-01. ER 
i are presented in T: able 4:2 which indi- 
The second set of spiral scores are pi pues dn GRE ii 


cates the increase in persistence L to t 
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preceding stimulation. (The pilot study group gave means of: 15 secs.: 
15-4; 30 secs.: 19:6; 50 secs.: 19-6; 80 secs.: 21-7; 100 secs.: 21:7.) 


TABLE 4.2 


Spiral Scores . 
Table showing the growth of persistence of the after-effect with increasing periods 
of prior stimulation. Figures in the body of the table are means and standard 
deviations for the total group: N = 80 


Prior After- 
stimulation effect 
(seconds) (seconds) S.D. 
15 15:154 5:871 
30 18:679 6:210 
50 19-846 57116 
80 20:000 5-800 
100 20-744 6:658 


The final measures are the derived scores of height, slope and distance, 
seen in Table 4.3. 


TABLE 4.3 


Spiral Scores 
Table of derived scores height, slope, and distance. Figures in the body of 
the table are the means and standard deviations for the total group: N = 80 


Mean S.D. 
Height 18-886 5-370 
Slope —5:590 5-253 
Distance 2:817 5-745 


gain that there is a gradual increase in per- 
me and that the small positive score represent- 


ing the curvature of the acquisition time shows that the curve is slightly 
negatively accelerated, 


TABLE 4.4 


Spiral Scores 
Means and standard deviations of the second five scores for two selected 
STOups representing high and low extraversion 
1 2 3 4 5 
HighE 13-1 16-5 175 1755 17.5 
S.D. 55 65 50 58 58 


LowE 159 20:2 19-6 20-1 20:8 
S.D. 64 68 64 $7 62 
N = 23 for both groups = 28-75% of total group, 
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The extraversion hypothesis has been examined by selecting two 
groups which have high and low scores on the criterion measure of 
extraversion. Both groups number 23 and thus represent the upper and 
lower 29% of the total sample; the high E group having scores above 
36 and the low E group having scores below 25 on the extraversion 
scale of the MPI. The scores of the two groups are given in Table 4.4 
and the difference between the two groups (t = 1-16 p < 0:05 n.s.) is 
also illustrated in Fig. 4.3. See also Fig. 4.4. for the distributions of the 
total and selected groups. 
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FIG. 4.3. Spiral Aft ter-Effect Persistence (Selected Groups). 

The results of the spiral test have tended to confirm previous experi- 
mental results and generally supported the proposition that the persist- 
ence of the after-effect is related to the personality dimension of extra- 
version/introversion as measured by the MMPI. When extreme groups 
in a normal population are opposed, discrimination 1s fairly clear, al- 
though not significant statistically, and there is no overlap in the ipee 
It could be expected that pathological groups, e.g hysterics and brain 
damaged, would be differentiated even more clearly. diede 
Although test/retest can only be assessed on a trial to s ig i 

is very high for this kind of perceptual test, i.e. approximately 7b : 
the variance between trials is accounted for. The derived score height 
also correlates with the first spiral measure to the degree of r = 0:801, 


i ight i r isition rate suggests 
which se height is an average score of the acquisition rate st 
aee bi ? It, of course, also implies that 


that this rate is fairly monotonic. This resu 1 i 
the experiment has not been completely successful in demonstrating the 
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expected plateau of persistence times which had characterized earlier 
work. Fig. 4.3 suggests that a plateau has already started with the high 
E group but that the low E’s may require longer periods still. 

In outlining the correlational aspects of this and subsequent sections 


DISTRIBUTION OF SPIRAL (100 SECS) 
PERSISTENCE SCQRES FOR 


TOTAL GROUP N-80 


SELECTED HIGH E GROUP 
N=23 


SELECTED LOW E GROUP 


N=23 
0-10 ~~ 10-20 ' 20-30 - - 
TIME OF AFTER-EFFECT IN SECONDS en 
FIG. 4.4. 


of the present report, it is not the intention to advance 'strings' of co- 
efficients of every test with every other test. The total matrix of correla- 
tions is presented later on and includes the present tests as well as 
others. All that it is useful to display are the correlations of the specific 
measures with the independent criteria of extraversion, i.e. MPI and 
CPF E scores, and any correlations with other test measures which may 
be of special interest. 

Four measures of the spiral scores have been used for correlational 
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purposes. These are height, slope, distance, and spiral effect during the 
100-sec. trial. The first and the last of these scores are highly correlated, 
i.e. 0-801, and appear to be measuring the same function. They can, 
therefore, be considered together and be representative of the persistence 
of the after-effect in time. The correlation of this function with both 
criterion scores of extraversion is as follows: 


MPI (E) CPF (E) 
(1) height —0-203* —0-205* 
(2) Spiral 100 —0:154 —0-206* 


all areas predicted and three are significant at the 5% level of confidence. 
(For 78 d.f. 5% = 0:187, 1% = 0:264.) 

The correlations of the other scores are not as clear as those in the 
preceding paragraph, i.e. CPF and slope, r = — 0:143; MPI and slope, 
r — 0-069; CPF and distance, r — 0:037; MPI and distance, — 0:083. 
None of them achieve statistical significance and three of them are so 
meagre as to preclude any discussion of them in terms of the hypotheses. 
The fourth CPF and slope suggests that the rate of acquisition of the 
spiral effect is not related to extraversion, although this conclusion can 
only be tentative. 

Of particular interest are the correlations of height and the second 
spiral persistence score (i.e. spiral 100) with other questionnaire scales, 
specifically with the Pd scale ofthe MMPI r — —0:210 and, —0-263 and, 
in the former case, the K scale of the same questionnaire, height and 
Kr = —0-207. The failure of the measures to display a significant rela- 
tionship with the Hy scale (neurotic extraversion) Hy and height, 
r = —0-095; Hy and spiral 100 r = —0-147, cannot be accounted for, 
unless the reason lies in the multifactorial composition of the scale as 
already outlined. 

Of particular interest also is the set of correlations between the two 
spiral persistence scores and the contoured figural after-effect. They 
are positively correlated and three of the four coefficients are significant, 
ie. height and figural after-effect = 0-213 and 0-261; spiral 100 and 
figural after-effect = 0-163 and 0-252. This suggests, as is pointed out in 
a subsequent section, that the magnitude of the figural after-effect by 
the present method of elicitation, is not a simple measure of extraver- 
Sion, and, like the spiral, is a negative correlate of ‘E’. 


Fixation 

The importance of fixation as a variable in experiments on visual 
figural after-effects has been emphasized by many studies (Nichols, 
1956; Eysenck, 1957; Summerfield, 1955; and Heron, 1956). Yet it is 
Strange that no investigator has reported on the various measures of 
after-effect in relation to accuracy of fixation, using accuracy in its 
Widest sense to imply maintenance as well as excellence. Too often in 
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psychological experiments, either implicitly or explicitly, the *co-opera- 
tiveness’ of the subject is stressed and from this the naive assumption 
made that because the subjects are instructed to keep their eyes on a 
certain spot (or fixation point) the effectiveness of the stimuli they 
receive is equal (cf. Barlow 1952; Ditchburn and Ginsberg, 1952; 1953; 
Westheimer and Conover, 1954). The vast body of knowledge concerned 
with individual differences in perception suggests that such an assump- 
tion cannot be made and that the relative ability to fixate may be a con- 
tributing factor to such differences (Holland and Beech, 1958; Ratliff 
and Riggs, 1950). (It is true that in a random sample of the population, 
fixation, as a variable, might be normally distributed, but where differ- 
ent personality groups are chosen or pathological conditions opposed, 
it may assume considerable importance.) 

As Osgood (1953) has pointed out, the fundamental fallacy concern- 
ing fixation is the notion ‘I fixate (look at) something intentionally by an 
act of will’, for although gross hunting movements of the eyes may be 
voluntary, the precise alignments which guarantee single vision are due 
to involuntary mechanisms. Hence volition can play only a small part. 

The simple neurological model for fixation by which distributions of 
excitation representing the disparate images overlap until, reaching a 
critical point, they summate with lateral inhibition erasing secondary 
contours, is confounded by perceptual and attitudinal factors, e.g. 
familiar figures ‘fuse’ more readily, as do figures having ‘good’ form, 
after fusion these figures also resist diplopia. Blinking? also involves a 
break in fixation for not only is vision occluded by the eyelid but, due 
to Bell’s phenomenon, the eyes are moved (Bergman et al., 1952; Bruell 
and Albee, 1955; Ponder and Kennedy, 1927; Hoffman, 1946). Both 
eye movements and blinking have been studied after the induction of 
depressant drug (Bergman et al., 1952; Nathanson et zl., 1953), which 
is of special interest here because of the extraversion/drug hypothesis. 
Under the influence ofthe drugs both convergence and divergence of the 
eyes is altered and in some of the cases reported subjects were able to 
fixate with one eye only. Indeed, in some of the extreme cases the drug 
produced both vertical and lateral nystagmus, and other signs of cortical 
and subcortical sites of action. Any test result which might indicate 
when the stimulus, in relation to its retinal and central location, was 
moved, or when it was occluded by blinking, would be a useful addi- 
tion of information in accounting for individual differences in percep- 
tual phenomena of an inhibitory type. 


The Spiral as a Test of Fixation 


A recent observation by the author gave grounds for the belief that 
the spiral might be used as a simple test of eye movements. Under nor- 


* Blinking has also been employed as a measure of visual efficiency. The interested 
reader is referred to the followin; 


reports: Lucki ; Bi 1936, 
1845) and oe ea n g rep uckiesh (1937, 1946); Bitterman ( 
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mal conditions, when the spiral is being used as a method of inducing 
apparent movement, it is rotated at speeds varying between 50 and 
150 r.p.m. These speeds are necessary because the after-effect tends to 
break down beyond them. It has been observed, however, that when the 
spiral was rotated within the region of 600 r.p.m. and viewed from dis- 
tances between 34 to 5 ft., then eye movements whether involuntary or 
voluntary, or whether due to part of the blink process, produce on the 
uniform greyness of the disc a ‘flash’ in which part of the spiral emerged 
and could be seen clearly for a fraction of a second. The impression 
gained is as if a part of the spiral were seen as standing still or, in some 
cases, as travelling at a very much slower rate. The effect also occurs 
when deliberate blinks are made. 

The frequency of the reported ‘flashes’ varies from about 3-50 (the 
vast majority being between 5-35), over a one-minute period, but any 
one person is reasonably reliable from trial to trial. 

The following explanation of this phenomenon is tentatively sug- 
gested; although it is unlikely that at this stage of development any 
statement could be comprehensive. If the spiral is rotated at a relatively 
high speed, but one which will permit discrimination between spiral and 
background, it quickly loses the appearance of a spiral turning outward 
or inward on itself and appears as almost concentric ‘pulses’ moving 
from centre to periphery. Now, if it is assumed that the same process 
Continues, as the speed increases, beyond the discrimination level, then 
any change in the fixation point (located at the centre) may momentarily 
Synchronize with the ‘expanding’ line of the spiral. This stroboscopic 
effect is only likely to occur when eye movements are relatively rapid 
and not when they are slow; which may account for the fact that aimless 
and slow movements do not necessarily produce the phenomenon. 

It is rather difficult to describe the characteristics of the ‘flash’ but 
many subjects have said that something like ‘a black line’ appears, or 
that it ‘becomes more black and white’ or ‘it seems to stand still for a 
moment’. It seems as if, for a very small moment of time, a certain part 
of the spiral is suspended in its movement. Not all of the ‘flashes’ are 
in the same magnitude or in the same place which suggests a differential 
direction and amplitude of eye movements. A simple and analogous 
phenomenon is found when a tuning fork is set to vibrate at a frequency 
above the discrimination threshold and is moved across the field of 
vision. The movements of the eye following the fork, being largely bal- 
listic in nature, make rapid jumps which overtake the fork. During the 
Co-incident movements of eyes and fork the prongs are occasionally seen 
as Stationary. It will be appreciated that with the expanding spiral such 
Co-incident movement is made necessary by the fact that any movement 
away from the central fixation point, if its speed and amplitude are great 
enough, must ‘overtake’, or be itself ‘overtaken’ by a part of the spiral 
as it unwinds. 
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Preliminary experiments using the spiral as a test of fixation have been 
carried out. So far they have shown large individual differences with 
good reliability. In one experiment high correlations (rank order) have 
been determined between performances on the spiral and reversal figures 
on the Necker Cube Test, although the relationship is not one to one 
which militates against a simple hypothesis of reversal being ‘triggered 
off’ by eye movements (Eysenck et al, 1956). Another set of experimental 
results from a preliminary study had shown a clear correlation between 
persistence of after-effect duration and eye movements as measured by 
the spiral test, ie. low persistence times went with high number of 
‘flashes’. 

Any explanation of the relationship between poor fixation and low 
persistence scores is complicated by the yet unexplained nature of the 
spiral phenomenon and until this doubt is resolved and the effect sub- 
sumed under some theoretical generalization, only the empirical data 
are relevant. However, if the after sensation of movement is properly 
accountable within a satiation-type hypothesis, the importance of fixa- 
tion is clear. (The writer has proposed that a possible explanation of 
the poorer performance of brain damaged subjects on the spiral test (see 
previous section) may be due to the known distractability of this group. 
As distraction in visual perception is likely to involve fixation (as well 


as attentional) efficiency, the possible contribution of its influence must 
be considered.) 


Procedure 


To perform this test the subject was seated in his combined chair and 
chin rest. In his right hand he held a very lightly spring-loaded push 
button through which circuit to a counter was made or broken. The 
spiral dimensions, etc., were as for the spiral test. A demonstration was 
given until the subject was well aware of the phenomenon he was to 
report and afterwards a practice trial was undertaken. Both demonstra- 
tion and practice occurred immediately after the demonstration and 
practice of the spiral after-effect. The subject was then asked to keep 
the phenomenon in mind until required. 

In the test proper, the method called for two one-minute trials with 
a three-minute interval. 


Results 


The mean number of eye movements assessed by the present method 
appears to be very stable over trials separated by only a few minutes. 
This is shown by the trial means of 20:321; S.D. = 10-158, and 21:218, 
S.D. = 10-182 and the inter-trial correlation of 0-937 (Test mean for 
pilot group = 18:8). On the grounds of these scores there can be no 

1 Indeed, in one experiment it i i i i ere 
changed every five ME while bees sli Eds pies gent 
no effect, ora very much reduced effect, followed considerable periods of stimulation. 
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doubt concerning the reliability of this test of fixation although its 
validity cannot be adequately assessed. Content validity would appear 
to be high because of the logic of the test situation and because itis hard 
to imagine what the test is measuring so reliably if it is not eye move- 
ments. The predictive validity of the test must be considered low be- 
cause of its failure to correlate highly with other tests which might be 
affected by eye movements. Furthermore, when the extreme groups 
which have been alluded to previously are compared, the measure fails 
to discriminate them at an acceptable level of confidence; i.e. Low E, 
18:717, S.D. = 8:57; high E, 21-087, S.D. = 11-71, a difference of 2:370 
(t = 0-766, p n.s.). 

The examination of the correlation of this test with the criterion 
measures and other visual tasks indicates its failure to behave in the pre- 
dicted direction, and support other experimental indications that it is a 
correlate of extraversion and has a considerable part to play in visual 
tasks requiring steady and maintained fixation. 

The two correlations of this variable with the contoured figural after- 
effect scores, i.e. —0-006 and —0-101, and those between fixation and 
the main spiral scores of height and spiral 100 of 0-004 and 0:012 pre- 
clude the possibility of any conclusion. The only other correlations of 
interest are between fixation scores and some of the questionnaire 
measures. For example, the Distortion ‘D’ scale of the CPF (r = —0:238), 
the Neuroticism scale of the MPI (r = 0:168), the K, Pd, Pt and Hy 
scales of the MMPI r = —0:110, —0-067, 0-178 and —0-133. As several 
of these scales have a neuroticism component, it is possible that differen- 
tial eye movements are due in some way to the neuroticism factor. 


Figural after-effects 
Experience and experiment teach that sensory intensities vary under 
conditions of continued stimulation. At the simplest level, this is 
measured as differences between, or progressive changes in, absolute 
and difference thresholds for stimulation. That a process similar to this 
Sensory adaptation is operative in more complex experience is sug- 
gested by a group of findings culled from many diverse areas of psycho- 
logy. Called severally, adaptation with negative after-effect, figural after- 
effect, etc., these discoveries indicate that adaptation is a function of 
stimulus intensity and differential adaptation in a stimulus array due to 
differential excitation by preceding intensities. Now, as the effects of 
stimulation persist after the removal of the eliciting stimulus, it fol- 
lows that a constant stimulus which impinges upon, or crosses, an area 
which has been subjected to greater excitation than an adjacent area 
will have less effect in sensation, than in an area of lessened adaptation. 
In other words, sensory thresholds being a function of the degree of 
adaptation will be higher in areas of previously strong stimulation. 

A great number of early psychophysical experiments have outlined 
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the characteristics of sensory adaptation of the simpler kind and gener- 
alizations about its temporal course and other features have been laid 
down. In the more complex situations, observed in the distortion of con- 
tours and the subjective normalization of objectively distorted percepts, 
two main theories have been advanced to explain the phenomenon. 
The first (Gibson, 1937 and 1937b, 1954; Gibson and Waddell, 1952) 
postulate sensory adaptation with negative after-effect as the initial 
cause of distortion and argue that normalization is due to the possession 
of the perceiver of certain psychological norms or acquired standards 
of, say, verticality. If, according to this view, a stimulus is presented 
which deviates from the norm, adaptation operates to make it like 
the norm, and subsequent stimuli in the same area are distorted as 
the complementary of this adaptation. The second theory, and prob- 
ably the more important, has been advanced to explain a great many 
perceptual effects in several modalities. Originally suggested by Kohler 
and Wallach (1944) to account for phenomenal displacements in visual 
perception, it required a sequence of two stimuli known as the inspec- 
tion or satiating figure and the test figure. When presented in the correct 
temporal and spatial relationship, the contours of the test figure are 
distorted in a predictable way by the inspection figure. This distortion 
or displacement is called a figural after-effect. 

„The underlying mechanism of figural after-effects is satiation, a con- 
dition of the mediating neurones akin to a localized fatigue effect in 
the central projections of sensory surfaces. One of the basic assumptions 
of the theory is that a stimulus figure is iso-morphically projected on 
the cortex setting up, coincident with its contours, a state or condition 
of increased resistance to subsequent stimuli in the same area. This 
anelectrotonus lowers the polarizability of the affected membrane and 
any subsequent figure current in the same area is deflected away from 
the satiated region and into an area of less resistance or higher response 
potential. The resulting perceptual changes are referred to as figural 
after-effects. Satiation as such has the primary logical status of an inter- 
vening variable in that no orthodox neurophysiology will cater for it ade- 
quately. The original work of Kohler and Wallach contains an extensive 
discussion of the electrophysiological and chemical substratum which are 
postulated as subserving the distortions which have been referred to. 

Although of primary interest in visual perception, where many para- 
meters of satiation have been established, e.g. temporal aspects by 
"— (PD). Ikeda 1953a, 1953b), Oyama (1953),? Parducci (1956), 

8 ); Universality by Marks (1949), George (1953a, 19530); 


1 ] : à à 

; Gibson's experiments are given primacy here but it is clear that he was aware 
of the work of Stratton on adaptation to reversed fields which must be considered 
as the forerunner of normalization and similar experiments 


? For recent, and some of the most intensive, work on figural after-effects in Japan 
the reader is referred to the review by Sagara and Oyama (1957). 
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Retinal size by Sutherland (1954), Prentice (1947, 1950) and brightness 
by Kohler (1951) and Freeburne and Hamilton (1949), the theory has 
been successfully applied to other modalities and across modalities 
(Jaffe, 1956). 

In auditory phenomena an effect analogous to the classical visual 

effect has been described by several authors (Deutsch, 1951; Jones and 
Bunting, 1949; Krauskopf, 1954; Christman, 1954; Nichols, 1956) and 
kinesthesis by (Jaffe, 1954; Klein and Kretch, 1952; Eysenck, 1955; 
Nichols, 1956 and Nachmias, 1953). In autokinetic movement Crutch- 
field and Edward (1949) have demonstrated the effects of pre-satiation, 
Carlson (1953) and Hochberg (1950) have shown that prior satiation 
influences the direction of reversible figures, Deatherage and Bitterman 
(1952), Shapiro (1954) and Brenner (1953), have outlined its influence 
in apparent movement, and Kohler and Fishback (1950a, 1950b) and 
Kohler and Wallach (1944) have maintained that the disappearance of 
the Muller-Lyer illusion upon repeated trials may be accounted for by 
differential satiation (but cf. Eysenck, 1958). The crossmodal study re- 
ferred to above is that of Jaffe who has shown that visual fixation of 
different widths of paper exerts an influence upon kinesthetic judge- 
ments. This finding of Jaffe’s is in line with work being conducted in 
Russia according to Professor Luria of Moscow.* 
_ It has been repeatedly observed that there are individual differences 
in the components of figural after-effects, e.g. magnitude, ease of elici- 
tation, and resistance to decay. Few studies, however, have deliberately 
investigated these differences. Exceptions are the work of Klein and 
Kretch (1952), in kinesthesis who have modified the original satiation 
theory to a proposition of the basal level and state of cortical con- 
ductivity and have postulated, and demonstrated, differences between 
brain damaged and normals. Almost simultaneously with the Klein 
and Kretch investigation a study by Jaffe (1954), which had several 
weaknesses, failed to discriminate between a brain damaged and normal 
sample. 

Using the method and apparatus of Klein and Kretch’s experiment, 
Eysenck (1955) made the same prediction of the hysteric as the previous 
authors had made of the organics, These predictions followed directly 
from his theory of personality and empirical evidence of the many simi- 
larities in the behaviour of these types. The results of the study supported 
the conclusion that kinesthetic figural after-effects in the hysteric group 
appear more quickly, are more strongly marked, and disappear more 
Slowly than in the dysthymics. Eysenck’s findings on this were supported 
by Nichols although his very large battery of satiational indicants which 
Included tests of the Muller-Lyer illusion, reversal of perspective, nega- 
tive afi ter-image duration, apparent movement, and figural after-effects 

* This information came from a discussion with Professor Luria on a recent visit 
to London. 
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in auditory and visual modalities, failed on the whole to differentiate 
between his groups of extraverted and introverted neurotics. Nichols 
second prediction, i.e. that there would be a significantly high positive 
correlation between the tests mentioned within cach group also failed. 

Individual differences in figural after-effects have also been investi- 
gated by Wertheimer (1954a, 1954b, 1955a and 1955b). In a series of 
experiments he has adopted a somewhat different line from previous 
authors in this field. Beginning from the premise that differences in 
figural after-effect must represent differences in the modifiability of cor- 
tical conductivity he argues that such differences must be related to the 
general metabolic efficiency of the individual. He has not only been able 
to establish this assertion (a) by demonstrating the relationship of FAE 
and metabolic drugs, but also (b) that schizophrenics, who have low 
metabolic efficiency, also have small FAE. 

Such a comprehensive theory of brain action in perception has, of 
course, not gone unscathed nor uncriticized. Smith (1948) has posed 
several questions which satiation theory cannot as yet answer. Hebb 
(1949) has criticized the anatomical model necessitated by the pheno- 
menon, while, on the neurophysiological level, Lashley et al. (1951) has 
conducted experiments to show that cerebral integration is not due to 
molar patterns of electrical stimulation. Finally, Luchins and Luchins 
(1953a and 1953b) have remarked on the inadequacy of the Kohler 
Wallach theory to account for test figure, and the pattern of which it 
forms part, i.e. their subjects obeyed the principle of Pragnanz instead 
of demonstrating the Kohler effect. 

An alternative to satiation has been presented in the form of the 
Statistical Theory (Osgood and Heyer, 1952; Marshall and Talbot, 
1940). Based on the notion of excitatory ‘peaking’ in the higher centres 
due to the different cellular ratio between retina and visual cortex and 
physiological nystagmus, statistical theory is able to account most 0 
the visual phenomena reported, and possesses the advantage ofan ortho- 
dox neurophysiology. Although adequate to explain visual figural after- 
effects it is difficult to envisage the application of statistical theory in 
other sensory modalities, or even to some visual phenomena (Eysenck 
and Holland, 1958), see also Smith (1952). 

In the present experiment the predictions follow from the same 
rationale as already outlined for the Archimedes spiral effect, i.e. 
through the formal and functional identity of satiation processes an 
Ii inhibition, extraverts should show larger and more intense 

igural after-effects than Introverts, and in a normal population these 
measures should be proportional to the degree of extraversion assesse 
by questionnaire. 
b: is, of course, necessary to qualify the prediction in the light of 
What is known about the importance of the equality of the stimulation 
received by the subjects. Eysenck D has already asserted that differ- 
18 


Measures of Perceptual Functions 


ential eye movement and the ability to fixate, attention in its widest 
sense, and the time through which stimulation is continued, will interact 
to make the figural after-effect a complicated measure. 

In this investigation an attempt has been made to obtain a measure 
of eye movements and the time factor has been controlled by using a 
constant period. There is, of course, no way of knowing in advance an 
individual’s response to such a time interval. 

The preceding prediction was examined in the present investigation 
on two measures of figural after-effect. These, for the purposes of the 
present experiment have been called the ‘contoured figural after-effect’ 
or Comparison of circles test, and the ‘Brightness figural after-effect 
or Douple D. From both may be obtained scores which are measures 
of the magnitude of the influence of the satiating stimulus upon the 
subsequent test stimulus. This is indicated in the first case by the 
apparent contraction of the circle which falls within the contour of 
a satiating figure which is similarly a circle in the same plane and 
retinal location of a slightly larger diameter. In the second case the 
score is a brightness matching error due to the continuous inspection 
of a preceding area of illumination. 

The nature of the tests are clear from the descriptions of the apparatus 
and those of the respective procedures. For the purposes of clarity these 
descriptions have been presented separately for the two tests. The results 
are given jointly because they are predicted by, and thus bear upon, the 
same hypothesis. 


Apparatus 


The comparison of circles test apparatus comprised two VCR 97 
cathode tubes upon each of which had been imposed a circular time 
base. Both tubes had independent size, focusing, and brightness controls. 
Fixation was provided by a leim ruby glass filter over a 75-80 micro 
amp. neon lamp situated midway between the circular tubes. When 
accurately centred at the correct size (6 cm.) the complete visual array 
subtended an angle of 36°. This visual angle was obtained by having 
the apparatus reflected in a front surfaced mirror placed in a position 
8’ 4" from the subject and 6’ 10" from the apparatus. The upper circle 
was standard figure and the lower the variable. Apart from the size and 
brightness controls already noted, controls existed for the lateral posi- 
tioning of the circles in relation to each other. Discrepancies between in- 
Spection and test figures were represented by avoltage difference and read 
off from a voltmeter which permitted accuracy to the nearest millimetre. 


Procedure for comparison of circles test 

The Subject was seated at the side of the apparatus close to the remote 

control for the variable figure. He was told to keep his chin on the chin 

Test and to observe the apparatus which was reflected in the mirror 
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directly in front of him. He was then instructed as follows: 'I want you 
to listen carefully to what I am going to say. In front of you you will see 
a mirror in which is reflected the piece of apparatus we will be dealing 
with next. On it you will see two circles, the lower much smaller than 
the upper. At your right hand you will find a small hand control 
which changes the size of the lower circle. For the time being I want you 
to practise making the two circles equal in size, telling me as soon as 
you have done so.' At this point the subject changed the size of the 
lower circle, practising to get the ‘feel’ of the control. ‘Now then, in that 
practice period you very probably tended to look up and down at the 
two circles to see if they were matched equal. However, in the experi- 
ment which follows in a moment I want you to keep your eyes quite 
still and fixed upon this small red spot (pointing). Do not move your 
eyes when making the two circles equal. You will find that the distance 
from this piece of apparatus to the mirror, and from the mirror to your 
eyes has been carefully calculated to enable you to see both circles 
when you look at the red spot without moving your eyes.’ 

The apparatus was switched ‘off’ at this point, and the subject was 
given another dark adaptation test. As soon as this test was over, the 
first part of the experiment was started. The two circles were switched 
‘on’ and instructions given to make four matches of equality, two 
ascending and two descending. 

The four judgements obtained in this way constituted the four pre- 
satiation PSE’s. The post-satiation PSE’s were obtained as follows: 
‘The second part of this experiment is also quite simple, I shall ask you 
to keep your eye fixed on the red spot in the centre of the apparatus 
for the next four minutes. The top circle will be “on” . . . but I want you 
to ignore it and keep your eyes on the red spot. At the end of the four 
minutes, which I shall call out as they pass, I shall switch “off” the 
top circle and immediately switch both top and bottom circles back 

on" again. When I do that I want you to make four judgements of 
equality in exactly the same way that you did earlier. Is that clear?” 
The upper circle was adjusted to Bive a size of seven centimetres, the 
subject alerted, and the apparatus switched ‘on’, The time was called 
Out as it passed as ‘so many minutes gone so many to go ... please 
keep your eye on the spot'. Thirty seconds before the end of the total 
trial period the subject was asked to put his hand on the control . . . but 
not to take his eyes off the Spot (fixation point). After four minutes the 
top circle was switched ‘off’, the experimenter's size control set to give 
six centimetres, and both circles switched ‘on’ again simultaneously 
with the instruction to make four post-satiation PSE’s. 

The test was done in conditions of almost complete darkness. The 
apparatus was seen by the subject simply as one or two luminous green 
circles with a small, dim, red Spot between them, against a black 
background. 
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Brightness Test 


A second test, which has been called a figural after-effect of bright- 
ness, was included in the test battery. The phenomenon has been called 
a figural after-effect because although being non-contoured as most 
visual satiation effects are, it does not differ from them in conditions of 
presentation or derivation. That is to say, there is no logical difference 
between the fixation of an inspection figure which is contoured or which 
is of relative homogeneous brightness. Both figures should produce 
satiation effects in the central areas mediating their perception and both 
should affect perception of subsequent stimuli in the same area. This is 
a deduction which follows from Kohler's original analysis and from the 
subsequent work of Bevan (1951). 

The test takes the simple form of a circle of light composed of two 
independently variable semi-circles. The subject is asked to make a 
series of judgements of brightness equality by varying one side of the 
circle until it is the brightness set by the experimenter. These (pre- 
satiation) judgements serve as a baseline against which to assess any 
difference between them and a second series of judgements made after 
a period of fixation in which only one half of the field is illuminated. 
Previous work has shown that the second set of judgements of equality 
are usually determined at a lower level than the originals, i.e. as if the 
portion of the visual system which had catered for the continuous stimu- 
lation and against which the standard intensity is projected, is less 
hs il than an adjacent area in which the comparison intensity is 
ocated. 

That these effects in brightness matching are not peripheral is clear 
from the fact that they transfer from eye to eye, i.e. where one eye is 
stimulated and the opposite cye is used to make the judgement. The 
second feature which suggests the classic satiation phenomenon 1s found 
in the fact that they only occur when the usual procedure of inspection- 
test situations are used. (For a discussion of brightness in FAE see 
Kohler and Wallach (1944), Kohler et al. (1952), Hochberg and Triebal 
(1955), Walthall (1946) and Marquart (1954). 

Two important questions may be asked about the present test. (a) Is 
the matching difference due to the characteristic ‘fading’ of the test 
figure as it falls upon an area of the retina cortex which has under-gone 
differential adaptation? and (5) Is the effect a contrast phenomenon? 

An affirmative answer to the first question cannot be proved from 
the present experimental method; although it is highly probable that 
the ‘fading’ effect mentioned by Kohler and subsequent workers has 
Some effect upon brightness discrimination. Indeed, if the phenomenon 
is a figural after-effect, fading would be expected, although in the con- 
toured test fading is said to be independent of displacement. The second 
question can hardly be answered in the affirmative because of the one 

221 


H. C. Holland 


essential difference between this phenomenon and contrast effects, i.e. 
time. Contrast effects usually appear immediately or very quickly in a 
single differential brightness stimulus, whereas the present effect follows 
from considerable periods of fixation and only after the conditions of 
prior inspection before test figure have been fulfilled. It cannot, how- 
ever, be denied that contrast effects, being the product of the degree 
of excitation will exert some influence, but as*Kohler has pointed out, 
increases in satiation will gradually reduce its effect, and they will be 
minimal where satiation is greatest. 

The prediction in this case is essentially the same as that for the other 
form of figural after-effect. 


Apparatus 


The Brightness Test* apparatus consists of a double-beam projection 
lantern having two identical projection systems side by side but deriv- 
ing their illumination from a single common source. Light is collected 
from either side of a slightly over-run 36w. 12v. auto-headlamp bulb. 
Each beam is turned through 90° by identical prisms, to enter duo- 
plano condensers, thereby illuminating evenly the object planes. The ob- 
Jects consist of 3 in high ‘D’ shaped apertures in thin brass plates, each 
adjustable laterally by screws against the counter pressure of springs. 
Double cell achromatic projection lenses focus the two 'D's back to 
back to form a circle of light 12 in. dia. 5 ft. distant, upon a white 
Screen. The brightness of either image may be independently varied 
throughout a wide range by means of slit diaphragms placed between 
the projection lens units. Colouration is indiscernible until refraction 
effects between the knife edges occur at the moment of extinction. 
Mechanical pointers rigidly fixed to the ‘shutter’ crank-shafts and des- 
cribing 5-in. arcs against a glass scale, give fair indications of illumina- 
tion. Each ‘shutter’ consists of a pair of parallel, straight edged blades 
in slotted guides moving in opposition. This movement is effected by 
connecting rods and crankpins set about a disc at 180° to each other. A 
slow motion drive is fitted to each hand control. 
] Mechanical back-lash and parallax errors encountered in this measur- 
ingsystem made necessary the use of more refined photoelectric methods 
for monitoring and light balance determination. Identical thin, clear 
glass slides were introduced just beyond the projection lenses at such 
an angle that some of the main beam is scattered on to a selenium cell. 
The output from each photocell is compared in a bridge circuit and 
the difference signal passes on to a transistor D.C. amplifier. The as- 


sociated indicator requires about 300 LA for full-scale deflection and 
carries a 34-in. pointer over a dimly illuminated ground glass scale. 
Graduations are non linear, being opened out about zero. Amplification 
of about 150 enable input signals of 2 A to deflect full scale. 


1 Acknowledgement is made here for the assistance and advice of S.E.R. Mabel. 
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Procedure for Brightness Test 


In the brightness test the instructions and method were as follows: 
The subject was seated in front of the apparatus with the occluder re- 
moved. He was instructed that the circle of light before him was com- 
posed of two equal halves; one which could be varied by the experimenter 
and one by himself. Demonstrations were given. The subject was then 
told that, starting from the dark position with his control, he would be 
expected to match for brightness any value that the experimenter cared 
to set but that he must keep his eyes fixed upon the small red spot in the 
centre of the circle exactly as he had done in the previous test. General 
practice trials were then undertaken. When practice had been satis- 
factorily undertaken the subject was informed that in the experiment 
he would have to make his judgements with one eye only as the other 
would be occluded. After instruction and practice the subject was dark 
adapted to the standard level of 10-4 x 0:60 c./sq. ft. 

When dark adaptation had been accomplished the apparatus was set 
so that the subject’s control was at ‘dark’ and the standard set at 0-5 
lumens although the light beam was shuttered. The subject’s right eye was 
covered. All that he could see at this point was a small red fixation light 
immediately in front of him. The subject was told: ‘I want you to keep 
your eyes fixed on the red light. When I project a level of brightness on 
the left-hand side of the circle I want you to match it with your own 
side using the control you have practised with already. I want you to do 
it quite quickly but not so as to make errors. Tell me as soon as you 
think the two sides are equal and then close your eyes.’ 

At this juncture the shutter was withdrawn and the subject made his 
first match. When the error reading had been taken from the null 
galvanometer, the occluder was transferred to the left eye and a second 
trial undergone. These trials were the pre-satiation judgements and they 
differed only in the levels of illumination set (0-5, 1-0, 1:5, 20 lumens). 

In the post-satiation judgements the trials were essentially the same, 
differing only in the following way: ‘This time I am going to ask you to 
look at the small red spot while the left-hand side is illuminated. After 
one and a half minutes, which I shall call out every thirty seconds, I shall 
cover the light, transfer the eyeshield to your other eye, and ask you to 
make another match of equality.’ "Te 

This was done, the testing being arranged so that the post-satiation 
judgement was always made with the same eye which had made the pre- 
satiation judgement. There was a one-minute rest pause, with eyes 
closed, between pre- and two minutes between post-satiation trials. 

No descending trials were undergone because it had been suggested 
by preliminary experimentsthat self-satiation was too rapid. To simplify 
statistical manipulation two scores were derived from the four trials, 
these are numbers 58 and 59 of the correlation matrix and represent the 
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pooled error scores of the first plus the third and the second plus the 
fourth trials. 


Results 


The results of testing for both measures of figural after-effect are 
presented in Tables 4.5 and 4.6. 


TABLE 4.5 


Brightness Test 
Table showing values and differences for pre- and post-satiation measures at 
the four levels of illumination. N = 80 


Pre-satiation Post-satiation 
Trials Mean S.D. Mean S.D. Diff. t P 
1. O-Slumens —9-654 6103 —14-410 8621 —4756 4457 0-01 
2 10 , —10910 7210 —19-231 11092  —8321 7683 0-01 
a bs 35 —14:897 8720 —25-462 12449 —10-565 8-548 0-01 
4 20 5 —16667 9:347 —29:333 14857 —12-666 8300 0-01 


Readings are in micro-volts taken from deflections from zero on the null galvanometer. 


TABLE 4.6 


Comparison of Circles 
(Contoured figural after-effect) 
Table showing values and differences for the two ascending and two 
descending trials. Scores in c.m. 
x Pre-satiation Post-satiation 
Trials Mean S.D. Mean S.D. Diff. t= pe 
l. Ascending 6008 0403 5471 0-531 0537 8262 001 
2. Descending 6.647 — 0-505 — 5910 0-488 0754 12504 0.01 
3. Ascending 6221 — 0457 5-814 0491 0-407 13:567 0:01 
^. Descending 6-506 — 0474 — 6.183 0453 0323 5383 001l 


25382 1839 23378 1:963 
6345 0459 5844 0490 


These tables indicate clearly that in both cases the expected after- 


effects were obtained and are fully significant in all cases. The mean 
values presented, however, give something of a false picture, and a 
measurable figural after-effect was not obtained for all subjects on all 
trials. This is in accord with other findings concerning the universality 
of the effect. In this investigation a number of subjects failed to observe 
the effect for individual trials and four of the 80 failed to produce an 
after-effect at all. None had failed in the pilot experiment where the 
average decrease in the test figure was 3-02 mm. 

The extraversion hypothesis in relation to the magnitude of the after- 
effect has been examined in the same way as the spiral scores. That is 
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to say, the highest 23 and lowest 23 scorers along the dimension of 
extraversion, some 29% in each case, have been opposed as separate 
groups. In the case of the circles test the examination resulted in group 
means and standard deviations of the combined difference scores on 
all four trials between pre- and post-satiation scores as follows: 
Low E, mean 8:696, S.D. = 5:858; high E, mean 7-783, S.D. = 4-888. 
These scores give a mean difference between the groups of 0-913 
which is not significant. The scores themselves suggest that the 
original hypothesis that extravers should display greater magnitude of 
after-effects is not supported in this experiment. This failure may lie in 


Low E Group. 


= = - ~ High E Group. 


w 


m 


MAGNITUDE OF AFTER-EFFECT IN MM. 


4 
TRIALS asc pisc ASC DESC. 
FIG. 4.5. Comparison of Circles (Trials 1-4) Selected Groups. 


the method employed in the elicitation of the phenomenon. One such 
possible weakness lies in the prior dark adaptation allowing retinal 
effects to proceed at a faster rate than de-satiation. Another possible 
weakness may stem from the long period of inspection (satiation). 
Extraverts are low in persistence and may have found a period of four 
minutes too long to maintain attention on the fixation point despite the 
encouragement that was given. Attentional factors cannot be ruled out 
in this type of visual phenomena and this was one of the reasons why a 
test of fixation was included in the battery. Performance on the fixation 
test is correlated with both measures but insignificantly (r = 0:006 and 
—0:101). . 

Another factor which may have resulted in this predictively contrary 
result lies in the non-transfer method being used. Previous experiments 
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(Nichols, 1956) had shown that the effect did transfer, indicating that 
it is a central effect, and therefore, because of test situation difficulties, 
binocular satiation and testing was conducted and the assumption made 
that differences in the two techniques would be irrelevant. It now ap- 
pears possible that this assumption was improper, although experiments 
using both techniques still require to be done. 

Fig. 4.5 displays the relationship between «he trial means for the 
selected high and low extraversion groups (the difference between pre- 
and post-satiation trials in mm.). It will be seen that there is a large 
difference in favour of the Low E group on the earlier trials which is not 
as manifest on the third and fourth trial. Indeed the trend appears to 
be reversing itself on the fourth trial, suggesting that the persistence of 
the effect is falling off more rapidly in the Low E than the High E group. 
(It must be noted here that none of the means differ statistically. Cri- 
tical ratios = Trial 1: t = 1-68, p = 0-10, Trial 2: ¢ = 1:33, pns; 
Trial 3: t = 0:499, p = n.s.; Trial 4: t = 0-068, p = n.s.) 

The present results are equivocal in relation to Eysenck's statement 
concerning the magnitude of figural after-effects and extraversion. With 
a sufficiently large number of trials the observed tendency might have 
reversed itself to the extent that a mean assessment would be a positive 
correlate to extraversion. The obtained results, however, suggest that in 
the early stages, i.e. immediately post-satiation, it is the introvert who 
has the greater displacement magnitude. 

As with other variables, a selected group comparison was attempted 
for the combined brightness difference Scores. Initially the group means 
obtained seemed incompatible with the positive correlations found be- 
tween the two threshold measures and extraversion as they indicated 
that the introverted end of the total group had larger after-effects than 
the extraverted. However, an examination of the scatter diagram of the 
combined scores against extraversion indicated lack of homoscedas- 
ticity and, therefore, no comparison is given. 

„The correlational results of these tests are generally equivocal to pre- 
diction. There are four measures in all, representing the ascending and 
descending thresholds on the comparison of circles test, and two thres- 
holds on the brightness test. Of the first test, intercorrelations with the 
criterion measures of extraversion are generally in the wrong direction 
from prediction and one of them is almost significantly so, i.e. Circles 
Ascending and E, i.e. —0-186. In the brightness test, however, correla- 


ction, but only one (between the second 
threshold and MPI (E)) is significant, i.e, r = 0:314. The first brightness 


be measured directly as there was 
no retest. However, when assessed by taking the average correlation 


between trials and correcting for attenuation they are found to be 
adequate, lying between 0-7 and 0-8. 
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There are several correlations between these and other measures of 
specific interest. Two of these have already been mentioned in the sec- 
tion concerned with the spiral scores where it was suggested that figural 
after-effect magnitude may be a negative correlate of extraversion. Also 
of interest are the significant and positive correlations between both 
ascending and descending thresholds on the comparison of circles test 
and dark adaptation (re= 0-259 and 0-242). As both correlations are 
significant, it is unlikely that this is a chance effect and that ‘time in the 
dark’, which is common to the two situations, may influence both 
measures: although this is clear in one case it is not so clear in the other. 


Auditory Flutter Fusion 


Although an immense amount of research has been undergone to 
demonstrate the determinants of the visual response to intermittent 
stimulation and the fusion threshold (Landis, 1954), the investigation of 
the ear as a temporal analyser where the main variables are similar, 
Intensity MS. Ratio, etc., has been relatively neglected by psychologists. 
To some extent this neglect is understandable, for quite recently Stevens 
and Davis (1938) were writing somewhat contemptuously of the squan- 
dering of effort on ‘flicker’ techniques, claiming that despite the success 
achieved by such methods in vision, in audition it was a failure. They 
argued, correctly, that the sudden interruption of a pure tone did not 
produce a stimulus void but sidebands around the carrier frequency 
and, consequently, what had been often considered as experiments in 
auditory flutter were in fact experiments in modulation. The problem 
of sidebands has been largely removed by the use of white-noise, itself 
a band of all the audible frequencies which when interrupted produces 
sidebands of each of its component frequencies retaining the uniformity 
of its spectrum (Miller and Taylor, 1948); Miller, 1947 and 1948; 
Symmes et al., 1955). ; 

The use of an auditory flutter technique in the field of individual dif- 
ferences (particularly in the abnormal sector (Ax and Colley, 1955; 
Jones, 1956; Chapman et al., 1955) would appear to hold equal promise 
with tests of visual analysis. Although the ear is probably not as good 
an analyser as the eye (Gebhard, 1955; Mowbray, 1955), its range is con- 
siderably larger. As Symmes has recently observed, the fusion threshold 
for light for the vast majority of people occurs at frequencies below 
80 c.p.s. whereas, with a mark space ratio of 0-5, auditory flutter may be 
discriminated at frequencies in the region of 1000 c.p.s. The work of 
Gebhard et al. (1955) supports Symmes' contention in the demonstra- 
tion that the auditory discrimination range is some 450 times that of 
vision and that the periodicity ratio is in the region of 16 : 1. Finally, 
there is, of course, the advantage of a direct energy transformation 
from light to sound and thus a direct comparison in terms ofa common 
measure of sensory resolution. 
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The immediate enthusiasm for auditory flutter techniques was damp- 
ened by an early adverse report by Goodfellow (1938) who investigated 
both auditory and tactile thresholds for ‘discreteness’, demonstrating 
considerable diurnal variability (3 db. for auditory threshold and 5 db. 
for tactile) in both functions. The notion of 'discreteness' (and lack of) 
has also been examined by Ax and Colley who have drawn the interest- 
ing distinction between discreteness and flicker. Loss of discreteness, 
they claim, is a generalized experience not dependent upon the particular 
sense modality involved. These authors have found high correlations 
between measures of visual, auditory, tactile and electrical stimulation 
in ‘psychopathological’ cases but not in a group designated ‘neuro- 
pathological’, but find insufficient evidence to decide whether temporal 
acuity is a generalized or specific function of the brain. 

One of the most recent and widely accepted hypotheses advanced to 
‘explain’ the phenomenon of auditory flutter fusion has been suggested 
by Miller (1947 and 1948) and Miller and Taylor (1948). They claim 
that the cue for flutter is perceived when the ‘off? period of the sound/ 
time fraction is longer than the time required for the auditory sensation 
to decay one detectable step, i.e. one intensity JND. Adducing evidence 
from their own work (and also of Bekesy (1951) and Symmes (1955)), 
which has indicated that the auditory decay to threshold is linear, 
they establish the following equation: 


LC 
f Ap T X 1000 

where f= the repetition rate for fusion; I — the sound level of the 
signal in decibels above threshold; T — the duration in milliseconds of 
the physiological decay to threshold; C — 1:00 minus the sound/time 
fraction; and AI — the intensity JND for noise in decibels. From this 
manion e is p obvious that f is largely dependent bn 
values of T when I and C are h i i oh 
Licklider, 1951), eld and AI is relatively constant ( 

Now the reports of Symmes which give distributions of T values, 
mean = 152 ms. range (104-278 ms.) for a population of 46 cases leads 
to the reasonable inference that the ‘physiological decay’ periods are 
measures of the disinhibitory functions of the mechanisms involved 
assuming a constant inhibitory effect for a constant stimulus. Schematic- 
ally this Proposition may be shown in Fig. 4.6, where the time ON 
represents the dissipation rate of a JND (Intensity) for the average per- 
son. Within the time ON there will be failure to discriminate flutter 
whereas at any point beyond N, because more than AI has been dissi- 

! The T values assessed in the 
constant AI of 0-4 db. have a mean of 242 Msecs. and a range from 166-583 msecs. 
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pated, flutter will be observed. The times represented OI and OE are 
the theoretical positions of introverts and extraverts, differing in the 
speed at which inhibitory effects disappear. 

If the above proposition is acceptable, it leads through Eysenck’s 
theory of personality to the immediate prediction of longer decay periods 
and greater inhibitory effects in extraverts than introverts. It will be 
remembered that extraverts develop inhibitory potentials more rapidly 


INTENSITY 


than introverts and they dissipate these potentials more slowly. It can 
be deduced, therefore, that auditory flutter thresholds will be higher in 
extraverts. 


The Test (Apparatus) 


The test used was the University of Chicago Sound Discrimination 
Test, which has been designed and developed in the Section of Medical 
Psychology under the direction of W. C. Halstead. Detailed information 
of the test may be obtained from the manual but the following brief 
description might be useful. The test is administered by a tapc-recorder 
(in this case a Crofts Radio). The signal consists of interrupted ‘white’ 
noise with square wave characteristic at an intensity of 70 db. Repeti- 
tion rates vary between 10 and 150 i.p.s., in steps of 10, and the sound/ 
time fraction is 0-90. There are four random presentations of the fifteen 
frequencies by the method of constant stimulus. Instructions, examples, 
and practice items are all contained in the first part of the recording, 
and they are reported as being adequate for normal subjects of average 
intelligence. ] 
Unfortunately, in the present experiment the test could not be given 
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in a sound-proofed room, but all extraneous sounds were dampened 


by the room preparation which has already been referred to and by 
‘doughnut’ muffs covering the earphones. 


Results 
The two measures of the test of auditory flutter fusion have been taken 
representing the threshold (AFF I) and the rarge of frequencies about 
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FIG. 4.7. 


which uncertainty was expressed by the Subject (AFF II), i.e. the range 
between all right and all Wrong. Means and standard deviations were 
as ri (AFF I): 72244, S.D. = 14-596 (AFF ID): 45-769, S.D. 
= 17-980. 
Comparison of the two extreme groups on both measures gives the 
results as shown in Table 4.7. 
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The means on the first measure are in the predicted direction, i.e. 
that extraverts have a lower level of discrimination, although the 
difference between the compared groups of 5-869 c.p.s. is not significant 


TABLE 4.7 


Means and standard deviations on the two measures of auditory flutter 
fusion for the selected groups 


(AFF I) (AFF II) 
Mean S.D. Mean S.D. 
LowE 75652 16-73 46:087 17:52 
High E 69-783 12:85 45:217 20-64 


by t test (¢ = 1:305, p n.s.). The difference of 0-870 between the groups 
on the second measure is also insignificant. The results cannot be con- 
sidered as supporting, at an acceptable level, the initial hypothesis. 

The correlations of both measures, i.e. threshold and range, are all 
insignificant with the questionnaire measures ofextraversion. The failure 
of the test measures to correlate with the personality scores is empha- 
sized by the fact that they are not even in a consistent direction, i.e. 
both coefficients are insignificant but negative with the MPI (E) scale 
and both insignificant but positive with the CPF (E) scale. 

Another feature of interest is the correlation of both measures with 
the K scale of the MMPI (r = 0:306 and 0:225) and the D scale of the 
CPF (r = 0-196 and 0:254). All four correlations are positive and sig- 
nificant. As one scale (K) is said to measure ‘attitude to test items’ and 
the other (D) is essentially a lie scale, it is thought that both measures 
should be treated with suspicion. 


Summary and Conclusions 


Six perceptual tests were given to two experimental populations with 
a total N of 97. These were: dark adaptation, auditory flutter fusion, 
spiral after-effect and two measures of fig. after-effect. With the tests 
were administered several personality questionnaires and scales. 

Two predictions were made: (a) that through the formal and func- 
tional identity between ‘reactive inhibition’, as defined by Eysenck, and 
‘satiation’, as defined by Kohler, a relationship would manifest itself 
between the perceptual test results and a questionnaire measure of in- 
hibition in the form of ‘extraversion’, and (6) in so far as the tests 
were indicants of differential inhibition sufficient co-relation should 
exist between them to suggest that they measured different aspects ofa 
basically similar process. 

Neither prediction has been affirmed with any great degree of con- 
fidence. Concerning the first prediction only three of the twelve corre- 
lations between the MPI scale of ‘extraversion’ and the experimental 
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variables are significant and one of these is contrary to the hypothesis, 
as are two others of the remaining nine which do not achieve significance. 

The qualification made of the initial hypothesis must also be made of 
the second. Although a greater number of correlations are significant 
than could be expected by chance alone, and a large proportion of the 
remainder are in the predicted direction, sufficient power of correlation 
does not exist to allow the immediate inference of a common cause.! 
When comparisons are made between two groups representing the upper 
and lower 29% of the extraversion dimension, there is again general 
failure to find support for the initial hypothesis. 

The measurements taken of the spiral after-effect persistence have 
emerged as highly reliable despite their failure to distinguish signi- 
ficantly between the selected groups, and the correlational evidence is 
such as to support the assertion that it is a reasonable measure of extra- 
version as assessed by the MPI and the C.P.F. 

The phenomenon of displacement or sensory reduction following 
continuous stimulation has been obtained at a satisfactory level of con- 
fidence, i.e. despite non-universality, in both measures of figural after- 
effect. The extraversion/satiation hypothesis, however, is not supported 
by the comparison of extreme groups or correlation. It has been con- 
cluded that the prediction of larger figural after-effects in extraverts is 
probably incorrect, particularly in the early stages. 

The general failure of the two major tests to correlate with, or dif- 
ferentiate between groups based on extraversion is repeated for auditory 
flutter fusion, fixation, and dark adaptation. 

The results of the present investigation are not critical to any theory 
and support only the complexity of ‘inhibition’ as an explanatory con- 
cept in psychology. Several weaknesses, both methodological and theo- 
retical are obvious and the more prominent may be stated as follows: 


(a) The measures taken are not related to inhibition or satiation in 
any straightforward manner; 


(6) That ‘inhibition’ is only a secondary characteristic in performance 
(as measured) instead of th 


e primary which has been postulated; 
(c) That the tests chosen and administered are not appropriate to the 


demonstration of measures which can be designated clearly as differen- 
tial inhibition indicants; 


(d) That too little is 
behaviour elicited, i.e. at 
has called ‘the unalysed 
variables ma 
process; 


known of the motivational aspects of the 
tention, interest, instruction, and what Dodge 
mass of obligations', and the way in which these 
y Suppress or change the theoretical course of the inhibitory 


1 It is, of course, almost impossible to make any assertion as to common causation 
by a simple visual or item by item examination of such a large matrix, and factoriza- 


tion is more appropriate. The factor analysis of these and other test results will 
found in a later section. 
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(e) That the functional identity of reactive inhibition and satiation 
requires re-examination before the promising line of research which it 
offers is obscured by premature inferences which fail to allow for differ- 
ence in ‘fatigue’ when generated by neural reaction and skeletal re- 
sponse. For instance, the motivational components, which have been 
already referred to, are usually clear in learning experiments and at- 
tempts have been made to lay down the relationships between ‘drive’ 
and Ip (fatigue). No such service has been attempted for satiation pheno- 
mena although it is known that abnormal groups, which may differ 
from normals motivationally, also differ in satiation effects; 

(f) That the personality dimension of extraversion being derived 
from a questionnaire which is basically behavioural in emphasis fails to 
account for all aspects of differential fatigue and discrimination in per- 
ceptual test measures. 

Once these weaknesses have been removed and the functions of 
satiation clarified, the hypotheses which have been examined in this 
Investigation will, perhaps, emerge in a more clearly testable form. 
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THE design of the experiment, details of which have been reported in 
the two previous Sections, was geared to a factorial analysis of the 
intercorrelations between the tests, and we must now turn to such an 
analysis. Fifty-nine scores in all were intercorrelated but only thirty- 
four of these were selected for a factor analysis. The reasons for the 
exclusion of the other scores lay mainly in the fact that they were 
alternative forms of the same test, or else scores of part of a test where 
only the total score was used in the factor analysis, and other reasons 
of a similar nature. The actual thirty-four test scores used are given in 
Table 5.1. They are not described in detail because the reader will find 
the detailed descriptions in the previous two sections; enough i$ 
said under the heading ‘Nature of Test’ to give a fair indication 


of the nature of each score without having to turn back to a previous 
page. 
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that the loadings on this factor are relatively high but have little if any 
psychological interest. It will not be discussed further in this chapter. 

Factors 1 and 2, in accordance with the hypothesis underlying the 
selection of tests, should be extraversion and neuroticism. The reader 
may find it advantageous to consult, in trying to answer the question 
of whether the results bear out the hypothesis, Fig. 4.1 (p. 195), which 
presents the factor loadings on Factors 1 and 2 in diagrammatic form. 
Lines have been drawn at values of +-0-25 and —0-25 as a rough visual 
check on the adequacy of the simple structure solution; it will be seen 
that hardly any tests lie outside the centre cross, and those that do are 
only very little removed from the inside. . 

We must first of all enquire regarding the position of the various 
questionnaires included, as previous experience with them gives us a 
fairly clear picture of the kind of solution expected in terms of our 
hypothesis. It will be seen that Factor 1 is characterized by the MPI 
extraversion score (—0:540), the Cattell extraversion score (—0:452), 
the MMPI psychopathic deviate scale (—0:458), and the MMPI 
hysteria scale (—0-307). These are the four scales expected to be 
measures of extraversion, and all of them are found to have medium to 
high loadings on Factor 1. We may conclude that in terms of question- 
naire measurements Factor 1 may be identified as one of extraversion. 
(It will be noted that all the loadings have a negative sign, so that it is 
the negative end of the factor which may be regarded as the extraverted 
end, while the positive end is to be regarded as the introverted end.) 

Itis interesting to note that both the Cattell and the MPI extraversion 
scales have nearly identical loadings on this factor, because both were 
developed quite independently, and indeed until quite recently Cattell 
was rather critical of the very notion of extraversion as a factor. The 
MMPI scales, which were developed independently of both the Cattell 
and the MPI scales also have comparable loadings, as indeed might 
have been expected on the hypothesis that both hysteria and psycho- 
pathy are extraverted neurotic disorders. The result would seem to pro- 
vide some evidence of validity for this hypothesis, and also for the 
validity of the MMPI scales. It will also be noted that in all the analyses 
reported to date (Hildebrand, Claridge, Willett-Holland) the extra- 
version questionnaire used had a loading of about 0-55 (0:563; 0:48, 
0:64; and 0-540 respectively). This constancy 1n spite of differing samples 
of subjects and tests is encouraging. The actual values are rather low, 
however; they suggest that only something like 3076 of the total variance 
of the factor is being measured by this scale. This puts severe limits to 
its use as a criterion; to obtain an estimate of the correlation of any 
Biven test with the factor of extraversion, when only its correlation with 
the questionnaire (rg) is available, the most likely true correlation 7, 
would be: E 

n = van 
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The usual run of r, correlation is around 0:3, which would give an 
r; correlation of above 0:5. These values, in turn, would of course 
require to be corrected for attenuation in the experimental test which 
was being correlated with the questionnaire, particularly when the 
reliability of such a test happened to be low. Some further discussion 
of this point is given in the Epilogue. 

The second factor has its highest loadings on the MMPI psychasthenia 
scale (—0-633), the MPI neuroticism scale (—0-582), the MMPI K-scale 
(0-753), and the Cattell lie scale (0:687). These four scales define the 
factor very accurately, all of them lying almost exactly on one line, and 
strongly support the hypothesis of Factor 2 as a neuroticism factor, at 
least as far as its verbal components are concerned. We may note with 
some surprise that the hysteria scale has a positive loading on this 
factor, i.e. is situated towards the normal rather than the neurotic end. 
It is difficult to know whether this loading is significant, and if so 
whether it would be duplicated in research using a less extraverted 
sample of subjects. It should also be noted that the MPI extraversion 
and neuroticism scales are located in positions such that the angle of the 
vectors joining them to the origin is almost exactly 90°. This agrees well 
with the usually observed independence of extraversion and neuroticism 
measures in normal samples. There is here a considerable difference 
from the results of the Claridge study, and we will return to this later. 

Having decided provisionally that Factors 1 and 2 may be identified 
with extraversion and neuroticism tespectively, we must now turn to 
consideration of the various scores derived from objective tests, in order 
to see to what extent these conform with prediction. The first test to 
attract attention by virtue of its high saturation is the length of after- 
effect produced by a rotating spiral. Four scores are included in the 
analysis of which two are direct measures of the length of after-effect, 
namely No. 32 and No. 4. These are expected to correlate positively 
with introversion and should therefore have positive loadings on Factor 
1; the loadings actually found are 0:726 and 0-685. It may be concluded 
that the prediction is verified. As regards the two other scores derived 
from the spiral, namely the ‘slope’ and ‘distance’ scores, it is difficult 
to interpret these or derive any special predictions for them. It might be 
argued that in terms of the theory outlined in H. Holland’s section, a 
high distance score’ should be found in extraverts by virtue of the fact 
that the inhibition which is supposed to underlie the curtailment of after- 
effects would be more effective with extraverts at longer periods of 
rotation, thus producing a negatively accelerated curve, and the fact 
that the score in question has a negative loading (—0:222) would seem 
to support such an interpretation. However, the loading is too low and 
the prediction too uncertain to inspire much confidence. 

The next test to be considered is the spatial conditioning test. Here, 
according to the hypothesis, we would expect scores 15 and 16 to 
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correlate positively with introversion, and indeed both scores have posi- 
tive loadings on Factor 1 (0-256 and 0-446). The prediction is therefore 
verified, but it should be remembered that the status of the spatial 
conditioning test as a measure of conditioning is doubtful for the 
reasons indicated by R. Willett in his section. Clearly further work is 
required on this test before we can confidently regard it as a true measure 
of spatial conditioning, and until such investigations have been carried 
out it is impossible to come to any firm conclusion about the apparent 
verification of our prediction. 

We next turn to the two measures of figural after-effect included in 
this study. The prediction linking length of after-effect with extraversion 
is clearly not verified. The test using contours (circles) has quite high 
loadings on introversion (0-460 and 0-551 respectively for descending 
and ascending trials). The scores of the test involving brightness are in 
the predicted directions, but the loadings are too low to be of any great 
interest (—0-068 and — 0-096). 

Before regarding these results as a disproof of the hypothesis, we 
may consider the argument put forward in the epilogue, according to 
which we have two possible effects produced by the inhibitory potential 
underlying extraverted behaviour. In the first place we have the fact 
that inhibition should mediate strong figural after-effects, a prediction 
clearly not verified in this study. In the second place, however, we have 
the prediction that inhibition should produce difficulties in maintaining 
fixation over long periods, an effect which in view of the importance of 
fixation for the occurrence of figural after-effects would lead to a lesser 
degree of figural after-effect in extraverts. We thus have two opposing 
hypotheses, and it seems necessary to introduce as an important experi- 
mental parameter the actual length of stimulation. Figural after-effects 
can be produced by quite short periods of stimulation, but it is unlikely 
that difficulties in maintaining fixation would arise until some consider- 
able time had elapsed. It might therefore be hypothesized that with short 
periods of inspection satiation would be generated more strongly in 
extraverts, as predicted, and the process would not be disrupted by 
difficulties in maintaining fixation. This might be true with periods up 
to one minute, say. After that point we might expect difficulties in 
fixation to arise and more or less cancel out the strong satiation effects 
produced by extraversion. With periods of three or four minutes we 
might expect difficulties in fixation to more than outbalance the stronger 
satiation developed by extraverts, and actually lead to an inversion of 
the prediction, so that with very long periods we may find introverts 
Showing greater figural after-effects. : 

The facts seem to support some such interpretation. Rechtschaffen 
(1958), using a forty-second period of fixation, obtained results showing 
Stronger satiation for extraverts at a level not far removed from signi- 
ficance on the appropriate one-tail test. Nichols (1956), using periods 
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of two minutes of fixation, and Holland in the brightness figural after- 
effects study included in this factor analysis, using ninety seconds of 
fixation, both obtained results showing no marked difference either way 
between extraverts and introverts. Holland, in the other figural after- 
effects study included in this factor analysis, used a four-minute period 
of inspection and found a quite marked relationship between intro- 
version and figural after-effects. While it is true that these writers all 
used different tests and different types of subjects in their various 
studies, it nevertheless remains true that the results do fall into a clear- 
cut progression such that short periods of inspection appear to indicate 
greater figural after-effects in extraverts, while long inspection periods 
appear to show less figural after-effects in extraverts, with intermediate 
periods of inspection giving neutral results. This would appear to be a 
hypothesis well worth testing directly in future research. 

It will be noted that test No. 29, which is labelled a test of failure to 
maintain fixation, is loaded on neuroticism (—0-400) rather than on 
extraversion (—0-026). This would appear to contradict our hypothesis, 
but it should be pointed out that the interpretation of this measure as 
one of failure to maintain fixation is by no means well-established, and 
Tequires a direct and formal proof before being acceptable. Even if it 
should turn out that this measure was indeed what it purports to be, it 
Would still be possible to modify the hypothesis outlined above by 
postulating some other inhibitory effect increasing in time and counter- 
acting satiation. There are several candidates for such a position. With 
maintained inspection the inspection figure tends to become less vivid 
and may occasionally disappear completely; this type of satiation 
effect, which according to the hypothesis would be stronger in extra- 
verts, might serve to reduce the impressiveness of the inspection figure 
and thus reduce the strength of the after-effects produced by it. (Cf. the 
first part of the hypothesis relating to spiral after-effects, where such in 
effect is explicitly postulated.) Other possibilities of a similar kind wil 
no doubt occur to the reader, and we must conclude that the data 


not by any means make a final 
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ment would seem to lead to the prediction that dark adaptation measures 
of the particular kind here used would show a poorer dark adaptation 
in extraverts, as is indeed found. The failure of neuroticism to correlate 
positively with poor dark adaptation is less easy to understand as in the 
past results have usually tended in that direction (Granger, 1953; 1954, 
1955, 1957). However, it is well known that dark adaptation is a highly 
complex phenomenon and correlations can be expected to vary accord- 
ing to the precise type of apparatus used and the precise method of 
measurement chosen. 

Both verbal conditioning (0-071) and eye-blink conditioning (0:116 
and 0-177) were expected to correlate with introversion, and while the 
factor loadings are in the right direction, they are clearly too small to 
carry conviction. While the prediction regarding the verbal conditioning 
Score was predicated on the hypothesis that this was indeed a measure 
of conditioning, a hypothesis for which there is no strong evidence, the 
same excuse cannot be made for eye-blink conditioning, which had on 
previous occasions correlated quite highly with extraversion. The only 
possible explanation of its failure to produce a high factor loading on 
introversion can be an appeal to sampling errors. The argument has 
already been presented by R. Willett and will not be repeated here. It 
would obviously be desirable for the experiment to be repeated in order 
to see whether this possibility can indeed be admitted as a likely ex- 
planation. There is a slight tendency, as shown by score 19, for verbal 
conditioning scores to be more variable among extraverts (—0:165), and 
this would seem to be in line with the general evidence of greater 
change and performance by extraverts; however the loading is too low 
to allow of any particular interpretation. . 

We must now turn to the tests correlated with neuroticism. Mention 
has already been made of test 29, or failure to maintain fixation; 
(—0:400); until the nature of this test is made a little more explicit, 
however, no reasonable interpretation of it can be given at the moment. 

Much the same is true of tests 9 and 10, both of which are measures 
of salivation (—0-468 and —0-539). This test had originally been 
intended as a measure of conditioning, and as such it would have been 
expected to correlate with introversion rather than with neuroticism. 
However, as shown by R. Willett, supplementary experiments have 
thrown considerable doubt on this interpretation of the test, and it 
seems likely that it must be regarded largely as the effect on salivation 
of a simple light stimulus. Why such a stimulus should produce effects 
of this kind is not obvious at the moment, and until more work has been 
done on salivatory conditioning, so-called, it is impossible to interpret 
the result. . 

Lastly we come to tests 2 and 3, which are measures of auditory 
flutter fusion (0:367 and 0:309). It would appear that neurotics have 
a lower flutter fusion threshold, and it would also appear that there is 
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no relationship between these thresholds and extraversion. The results 
seem to be in good agreement with the work of Krugman (1947) and 
Goldstone (1955) who, working with visual flicker, both found that the 
more neurotic groups tended to have lower thresholds while the more 
normal groups tended to have higher thresholds. The position is rather 
complicated by the fact that both Krugman and Goldstone used dysthy- 
mics as representatives of their ‘high neuroticism’ groups, thus leaving 
it open whether the low CFF thresholds were due to the neuroticism or 
the introversion of the subjects. In the Dynamics of Anxiety and 
Hysteria Y put forward a theory linking CFF to the extraversion- 
introversion continuum, predicting ‘in extraverts and hysterics lower 
thresholds of temporal discrimination’. Evidence on this point is not 
conclusive, but if it were true, as the present study suggests, that neurot- 
ics are characterized by low thresholds of temporal discrimination, 
then the hypothesis linking flicker and fusion thresholds to extraversion 
would have to be reconsidered and might in fact have to be inverted. 

This concludes our discussion of the results of this factorial study. It 

may be of some interest to compare some of the results with those 
reported by G. Claridge. It is by no means obvious that results, even of 
similar tests, should maintain similar relationships in two groups differ- 
ing as much as those here considered. As an example of the kind of 
difference to be expected, let us consider the position of the two MPI 
scales. It will be remembered that extraversion and neuroticism are 
quite unrelated in the Willett-Holland study, extraversion having a 
loading of —0-049 on the neuroticism factor, and neuroticism having a 
loading of 0-078 on the introversion factor. When we come to the 
Claridge study we see that this is by no means true. Here extraversion 
has a loading of —0-53 on the neuroticism factor, while neuroticism has 
one of 0-09 on the extraversion factor. In other words, among neurotics 
extraversion has a strong negative loading on neuroticism, while among 
normals it has quite a negligible loading on neuroticism. This fact had 
been observed several times before and indeed has been remarked upon 
in the Manual of the MPI (Eysenck, 1959b). Clearly there are interaction 
patterns at high neuroticism and high introversion scores which do not 
permit us to retain the hypothesis of linear regression. It is not im- 
possible that other similar effects will be found in connection with other 
tests, and it would seem that an investigation on a comparative basis 
of intercorrelations of different test scores in a normal and in a neurotic 
group would be of considerable interest and importance. 

Another test which clearly differs in its factorial orientation in the 
two studies is the spiral after-effect. In the Willett-Holland study spiral 
after-effect is highly correlated with introversion and very little with 
neuroticism; in the Claridge study it is highly correlated with neurotic- 
ism and only slightly with introversion. In both studies, therefore, it is 
the dysthymic who comes out as having the strongest after-effects, but 
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the main stress in the one case is on neuroticism, on the other intro- 
version. Why this should be so, is by no means clear. 

Auditory flutter fusion also does not give identical results in the two 
studies, In the Willett-Holland study it will be remembered that auditory 
flutter fusion had a negative loading on neuroticism; this is true also in 
the Claridge study, but only to the quite insignificant extent of a loading 
of —0-07. Auditory flutter fusion, however, has a loading of 0:39 with 
extraversion in the Claridge study, while it has no such loading in the 
Willett-Holland study. ' 

The only remaining tests which occur in both batteries are the MMPI 
scales, and these again provide considerable contrast. All the scales used 
have very high loadings on neuroticism in the Claridge study and no 
loadings at all on extraversion; it will be remembered that in the 
Willett-Holland study both the hysteria and the psychopathic deviate 
scale had high loadings on extraversion but not on neuroticism. These 
differences, of course, are understandable in terms of the much greater 
variance on neuroticism in the group studied by Claridge as compared 
with that studied by Willett and Holland; if corrections were made for 
this difference in variance, then rather more similar results might be 
obtained. os 

It seemed worth while to draw attention to these comparisons in spite 
of the small number of tests which was common to the two batteries, 
because it seems to be likely on methodological grounds that in future 
work much useful information could be gathered if identical batteries 
of tests were administered to groups of subjects widely differing in 
respect of their positions on dimensions under investigation, while as 
similar as possible with respect to irrelevant dimensions (age, sex, social 
background, intelligence, etc).! It is easy to predict, and even to control 
statistically, the simple effects of differences in means and variances 
along the dimensions involved, i.e. neuroticism and extraversion in this 
case; what is of much greater interest, and indeed constitutes an almost 
virginal field of exploration, is the problem of non-linear regressions, 
interaction of factors at extremes, and so forth. Such findings as those 


1 In our experience, it has usually been easier to verify hypotheses when dealing 
with neurotic than with normal subjects. Similarly, it has usually been easier to verify 
hypotheses with older than with adolescent (or even younger) subjects. The latter 
finding may be due to the relatively unstructured personality of young subjects (cf. 
Eysenck, 1960). The former finding may be due to the multiplicative action of D 
(anxiety), which, being stronger in neurotics, produces greater differences between 
extraverts and introverts where these differ in any characteristic which is multiplied 
by D to produce sEr. Both explanations are speculative, but it should not be im- 
possible to test them experimentally, or to investigate the correctness or otherwise 
of the impression mentioned above concerning different groups. The greater success 
of the Claridge experiment, as compared with the Willett-Holland study, may be due 
to the fact that in point of time his experiment was planned after the other one nan 
been completed, thus enabling him to benefit from the experience of Holland an 
Willett. 
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relating to the interaction between introversion and neuroticism demand 
a theoretical explanation and are likely in turn to lead to interesting 


and important experiments which might throw light even on problems 
other than those directly involved. 


SUMMARY 


In this chapter the factor-analytic study has been reported of a variety 
of verbal and performance tests of personality. These were selected in 
the hope of providing measures for the two factors of extraversion and 
neuroticism, and the factor analysis has clearly demonstrated that this 
hope was justified. A large number of individual predictions was made 
and specific hypotheses tested; some of these were verified, some failed 
to be verified; others had to be modified, and in others certain para- 
meters had to be added to the statement of the hypothesis to incorporate 
the results. A comparison was made of the results of the analysis with 
those of another study carried out on a group of neurotic subjects of 


similar age, sex, social class and intelligence, and certain interesting 
and instructive differences observed. 
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FACTOR ANALYSIS AND SOME 
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P. Slater 


1. THE GEOMETRICAL MODEL USED IN PRINCIPAL COMPONENT ANALYSIS 


THE methods of analysis to be discussed all apply to data which can be 
set out in a table with a number of columns, say 7 and a number of 
rows, say N. Each column refers to a psychological test and each row 
to a person: the entry or element in the Nth row and mth column is the 
Nth person's score on the mth test. Any such N x m table can be 
represented geometrically by a dispersion or scatter of N points in an 
m-dimensional space. To examine this geometrical model let us begin 
with one person's score on two tests. 

As a 2-dimensional space we may use any plane surface, e.g. the sur- 
face of a piece of paper. Any position on such a surface can be specified 
by a pair of measurements if two intersecting lines or axes are laid down 
and its distance from each of them is measured; and conversely any 
pair of measurements specifies one position. When the axes are used to 
represent the two tests the pair of scores the individual has obtained 
locates a position for him on the plane. Thus in Fig. 6.1 the position P 
is referred to the axes OX and OY by the measurements 4-14, +20. 
They indicate that a movable point will arrive at P from O by travelling 
1-4 units of distance in the direction of X and then 2:0 units in the 
direction of Y, i.e. parallel to the axis OY. In imagining them as carry- 
ing the point from O to P we are treating the pair of measurements as 
a vector. Another vector, —1:8, +3-0, would carry the point from O 
to a different position, Q (the negative sign serves to indicate that 
movement parallel to OX is in the direction away from X). . 

Any such vector, i.c. the co-ordinates giving the position of a point 
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anywhere in the 2-space, can be used to define an axis. For instance 
passing a line through O and Q and continuing it indefinitely in either 
direction defines the axis OQ. There is no limit to the number of axes 
that can be laid out on the same surface with the same origin. Any two 
vectors, provided they do not coincide, will serve as reference axes for 
locating a point on it. But a particular pair may be particularly con- 
venient for a particular purpose. Thus in Fig. 6.1, OQ might be con- 
sidered preferable to OY for use in conjunction with OX, because OQ 


FIG. 6.1. 


and OX are at right ang 
onto OX, OQ does not. 


Bas we do not require all the possible positions a point can occupy i” 

e "Space to specify all the possible dimensions or axes. The points 
lying on the circumference of a circle with its centre at O are sufficient. 
It is customary to use a circle with unit radius for this purpose. When 
the reference axes OX and OY are set at right angles to one another aS 
in Fig. 62, the co-ordinates of any such point satisfy the condition 


x? z= 1; i : : 
Ma ; they constitute what is generally called a normalize 


les to one another. In other words, OY projects 


This model can be extended to represent the results obtained when N 
people have been given two tests, X and Y, and their scores have bee? . 
standardized (means equated to 0 and s.d.s to 1). The reference axes 
OX and OY are set at right angles to one another oma plane, cf. Fig. 6 
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and the pair of scores obtained by each individual is used to locate his 
position. Evidently a scatter of N points will be obtained; and as the 
tests usually exhibit some degree of correlation it tends to be elliptical, 
as in Fig. 6.2, where it is shown by acontour line and a correlation of 0:5 
is illustrated. 

This ellipse has two canonical axes, the major one OP and the minor 
OQ, defined respectivelycby the vectors +1, +1 and +1, —1, which are 
its latent vectors and may be denoted l’, and l'a. A factor of 1/V2 can 
be introduced to normalize them. The first defines the dimension in 
which variation is greatest and the second the one in which it is least. 


Y 


FIG. 6,2. 


the ellipse is 


The equation for the ellipse takes its simplest form when ipse i 
canonical is 


referred to its canonical axes, and this is why the term 
applied to them. The equation then becomes 

p q* 

fF --£-1 

1 2 

Here 4,, is the square of the distance from the centre of the ellipse to its 
circumference along the major axis and A, is the square of the distance 
along the minor axis; the /’s are known as the latent roots. (Note that 
when A, = A, the ellipse is a circle.) The position of a point given as 
Xo, y; With reference to the X and Y axes can be specified with reference 
to the P and Q axes by using the /'s as weights ina weighted summation. 


Thus p= (xi + y)/V2 and qi F — y)/ V2. 
249 


P. Slater 


When the scores on the two tests are left unstandardized their 
scatter still remains elliptical but the centre of the ellipse may be at any 
distance from the intersection of the two reference axes, and the canon- 
ical axes may intersect the reference axes at any angle. eng eon, px 
equation of the ellipse is much more cumbersome and in genera a 
properties are much less easily described. However, there may : 
some conditions under which standardization should be avoided. The 
assumption that the N people differ from each other to just the ei 
extent in one psychological characteristic measured by x as in anot e 
measured by Y cannot be exactly correct and in many circumstances 1 
may not even be plausible, cf. Thomson (1950). It is excusable only as 
a first approximation when accurate information is lacking. — 1 

When the scores are standardized but left referred to their origina 
axes (X and Y in Fig. 6.2) the equation takes the form 

x?—2rxy--y?—k . 
where r is the correlation between the scores in X and Y and k is an 
arbitrary constant. Taking different values of k will give a set of con- 
centric ellipses with the same ratio 2, : 2,. Thus the ellipse encompassing 
the area within 1 s.d. of the bivariate mean point is specified by 
k — X1 — r2. 

This equation appears quite tractable, for as there are only two i: 
only one Cross-product term appears in it. But when the number 0 
Psychological tests used in the experiment increases to m, and the 
geometrical model expands from a 2-space to an m-space, the number 
of cross-product terms in the equation for the ellipse, standardized bu 
left referred to its original axes, increases to m (m — 1)/2; and the 
mathematical convenience of using the canonical form becomes SO 
much the greater. To describe how it can be put in this form I shall use 


matrix notation, and must apologize for introducing it without any 
adequate explanation for lack of space. 
Readers unfamiliar with tl 


which it occurs, the second the colum 
rows are re-written as columns; it is 
it is symmetrical about its leading d 


R=R’ 
meaning that R is identical with its transpose. For the typical element 
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in r, namely rj, the correlation between test i and test j, is identical with 
ry—in other words, it is entered in row i column j and again in row j 
column i. 

The m variables may be denoted by a column of x’s with tranpose 


x’ = (Xy, Xp... Xm) and the correlations between them may be written 
as 
n Tyo Fis >- Tim 
Toi 1 Fos >» Tom 
R= |r  /» 1 -> Tom 
Tm m2 Fm3 1 


ie. as a symmetrical square matrix with units in its diagonal. The 
equation for an ellipse describing the multivariate frequency distribution 
can then be given as 

xR x =k 
where k is an arbitrary scalar quantity as before. Differentiating we 
find that the particular values of x, sc. x = /, which define maxima and 
minima are specified by the ‘characteristic equation’ 


Rx = Ax 


where 4 enters as a Lagrange multiplier. This equation can be understood 
as implying that at points such as P and Q in Fig. 6.2 where the distance 
from the circumference of the ellipsoid to its centre 1s locally maximum 
or minimum the line perpendicular to the tangent (or the tangential 
hyperplane in the m-space) is continuous with the line from the circum- 
ference to the centre, so both are defined by the same vector. Looking 
at the Figure it is easily seen that this is a property which must character- 
ize the maxima and minima, and that other points on the ellipse do not 
Possess it. But the points on the circumference of a circle or hypersphere 
all exhibit it, for all are equidistant from the centre. j 

The best known method for obtaining the numerical solutions of the 
characteristic equation for a given R is the one due to Hotelling, which 
is fully explained by Thomson (1950), Holzinger and Harman (1941), 
and others. But very powerful methods have been developed more 
recently for use with electronic computers: they should certainly be 
employed where the facilities are accessible. The resulting 4’s and their 
associated vectors, the /’s, are often known as the principal components 
of the matrix of correlations. . 

The measurements on a 4 scale of an individual whose standardized 
Scores on the m tests are specified by the row vector X';, can be obtained 
by computing 1'x,. For this purpose the elements of / must be normal- 
ized, i.e. scaled down to satisfy the condition /'7 = T. 

The first latent vector, /,, defines the dimension in the m-space where 
variation is greatest, and its associated root, Jy, defines the extent of 
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variation in that dimension. The successive latent vectors l =h, la 
- ++ lm define dimensions at right angles to one another in the € 
reference space and where variation is uncorrelated and progressively 
smaller in extent. These properties can be summarized conveniently 


when the subscripts j and k are introduced to specify two non-zero roots 
and their associated vectors. 


1. The latent roots are progressively smaller, viz.: 


Ay dnega She 
2. The vectors are at right angles, viz.: 


lj, — 0 
3. They are uncorrelated, viz.: 


RI = 0 


2. WEIGHTS USED IN PARTIAL REGRESSION EQUATIONS AND AS 
DISCRIMINANT FUNCTIONS 


In many psychological experiments the m tests are used as a battery in 
the hope that some linear combination, i.e. weighted summation of uc 
will provide a reliable estimate of a criterion. For instance a battery © 
tests might be tried out in developing a procedure for selecting salesmen. 
Itcould be administered to a relatively unselected sample of persons yho 
afterwards pass through a standard training or probationary period. 
careful assessment of their success is made; it constitutes the criterion, 


Y. Correlations between the tests and the criterion, say 


y = (ry Fays». 'my) 
could be computed and the multiple reg 


Rx=y 


Solved to ob Particular set of values of x, usually denoted x =p 
and described as the partial regression coefficients, by which Y can | 


ression equation 
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differences between the means of the two groups found as 
d= Gi, ds... dia) 

The analogous equation 

Rx=d 
is solved to obtain another particular set of values of x, say x = y, 
generally known as discriminant functions. When substituted in the 
equation 

d; — y'xi 
they will give a discriminant score for an individual from which he can 
be allocated to one group or the other with a minimum risk of error. 


3. INTERNAL FACTOR ANALYSIS 


All such f's and y’s are vectors defining dimensions in the m-space 
Which may be of psychological or other interest in particular contexts. 
Unless it relates to some external criterion of specific interest there 
might not seem to be any reasons other than those of mathematical 
expediency for preferring one vector to another; all might appear 
equally worthy of consideration. The fact is, however, that psycho- 
logists are often predisposed in favour of particular vectors and can 
give reasons for believing that some define primary, unitary or funda- 
mental psychological traits while others do not. 

To avoid the use of affectively toned words, let us suppose that F1 
and F2 are two such traits which cannot be assessed directly. They are 
considered to be independent of one another. Individual differences in 
F1 are believed to affect scores in psychological tests of one kind, the 
reference tests for F1; differences in F2 affect scores on other tests, the 
reference tests for F2; and scores in some further tests may be affected 
by both F1 and F2 to differing, unknown extents. The m tests are chosen 
to include some reference tests of both kinds as well as the tests sub 
Judice. Tests suspected of assessing other characteristics in common are 
avoided. . 

If the results are in accordance with expectation, substantial correla- 
lations will be found among the F1 tests and among the F2 tests. Neither 
kind should correlate with the other, but both should have some correla- 
tions with the test sub judice. Two of the latent roots of R will be 
relatively large, the remainder will all be small and may not vary widely. 
The vectors associated with the first two roots will define a sub-space of 
two dimensions in the m-space which is called, when generalized, the 
common factor space, or f-space for short, the residual sub-space being 


the specific factor space or s-space. 
But they are not the only vectors which will serve to define the f-space. 
Any other pair of vectors lying in the same plane will serve equally well, 
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cf. Fig. 6.1 and the accompanying discussion. Techniques for rotating 
vectors are well known (Fruchter, 1954, for example) and they can be 
extended to any number of dimensions. So if there are valid psycho- 
logical grounds for preferring a particular pair there are no mathemati- 
cal objections. And again if equally cogent psychological arguments can 
be adduced in favour of another pair the mathematician will be well 
advised not to attempt to arbitrate. In spite of such hazards a consensus 
of opinion tends to grow up in course of time about the tests which 
should be used as reference tests for certain factors, and as it comes into 
existence the allocation of tests still sub judice tends to become less 
controversial. 

The unrotated factor loadings are not always obtained as the /'s in 
a principal component analysis. Other methods are employed. Con- 
sequently the f-space may not always enclose the maximum possible 
part of the m-dimensional dispersion, and the s-space will then retain 
rather more than it need. It will be still further inflated if one or two 
factors are discarded after rotation. But the outcome of these incomplete 
analyses is always to separate a residual sub-space with relatively small 
canonical axes from the common factor space, where the canonical axes 
is treated as containing the variation of 
space as psychologically negligible. 
hological reputation beyond the context 
is defined within that context by a set of 
the f-space enclosed within the m-space. 
must be regarded as a linear dependent 
m of the scores on the original battery of 
with any other vector or weighted sum of 
r independent battery of tests; the most that 
ree of correlation between the two. The same 
may however be extended to the second 


into an f-space containing axes alon 


€ have seen, by their ff-vectors and 
1-space of the original tests, 
tion to the factor space. Are 


9 find them contained in the 
254 


Factor Analysis and some Allied Procedures 


f'space? Or is this where they are usually found experimentally? There 
will be opportunity for considering the theoretical question in Chapter 7, 
where the general principles of canonical analysis of discriminance are 
described (cf. especially section 2) and an experimental example will 
be discussed in Chapter 8 (cf. section 5). 

Since the factor space is contained in the test space, factor scores can 
have no advantages over test scores for purposes of estimation. Some 
accuracy must be lost in substituting them for the test scores, even if it is 
not always great. Positive advantages can only be claimed for some 
other purpose, e.g. showing the relevance of the data to a psychological 
theory. 
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1. THE VARIANCE-COVARIANCE MATRICES WITHIN GROUPS AND 
BETWEEN GROUPS 
IN Chapter 6 the N cases under 


measure N individuals forming representa- 
tive samples of g groups, A, B... G. Any data originating from one 
of the groups can be indicated by one of the subscripts a, b... g. 
Thus 

ng = the number of cases in group G 

Xx, = the sum of the Observed measurements in these cases 
=x,” = the sum of the Squares of the measurements 


(Note that x is now used for à ràw, unstandardized Score.) 
Let 


Na + ny + oa Ny 
Exa + Ex, +... Ex, 
Xx + Xx iss Xx 
The totals provide a general mean x — 
the observed variation about it, viz.: V. 
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also obtain a separate mean for each group, e.g. X, = Lx,/n, for group 
G, and find the total variation within the group as 

yx, = Ex? — (Ex;)*/n, 
The total observed variation within the groups may be defined as 

yx = VXa + YXy +... VXg 

And an analysis of the variance of X could be carried out in these terms 
to test whether the group means differ from one another significantly. 
Vx would be broken down into the two sub-totals 

yx, within groups, and 

Vx-yx, between groups. 
And hence the mean square variances and the variance ratio would be 


obtained. 
Now suppose a second variable, Y is also measured in the same cases. 


We can obtain a general mean, y, a set of group means, Jo Y» ^s. Pos 
and break down the total observed variation in Y, that is Vy, into the 


two sub-totals, 
vy, within groups, and 
Vy-vy, between groups. 

The covariance of X with Y can also be treated in the same way. The 
sum of the products of the measurements can be obtained for each 
group, e.g. Lxy, for group g; then the general sum 

Ixy = Exya + Exp. Ixy, 
The total observed covariance in X and Y is obtained as 
Wxy = Exy — Ex.Ey/N 


The total within group g as 
wxy, = Exy, — Ex,Eyo/ro 
Finally Wxy is broken down into the two sub-totals wxy = WXYo 
-Fowxys +... wxy,, within groups and Wxy-wxy, between groups. 
(Some of the terms in these expressions may be negative.) 
Extending the procedure to the m variables, 


x! = [xo Xa- +- Xml 
we obtain firstly g sets of m means, one for each group, e.g. 
Xs = [ša Šp- Fam] 


the first denoting the group, the 


(the terms here have two subscripts, £ p, th 
ance-covariance matrices: 


second the test), and next three m x m vari 


S, the total, where the terms in the leading diagonal are such as Vx, 


and the rest such as Wxy; 
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T, within groups, where the terms in the leading diagonal are such as 
yx, and those elsewhere such as wxy; and 


U, between groups = S — T 


Both S and T may be used to form a matrix of correlations; and it will 
be convenient to distinguish between the two, as R, and Ri. 


2. PROPERTIES OF THE THREE DISPERSIONS 


In Fig. 7.1 there are four partly overlapping ellipses, A, B, C and D, 
one centred in each of the quadrants formed by the intersection of OX 


Y 


FIG, 7.1. 


and OY. Each exhibits a distribution of the same form; together they 
constitute a composite distribution with its centre at O. If we had 
simply this composite distribution before us, with no information for 
sub-classifying the N cases it contains, we might be satisfied with treat- 
ing it as approximately uniform and elliptical, and use the variance- 
covariance matrix S to specify it. 

Then later, if information of this kind came into our possession, We 
might at first doubt whether distinguishing between the constituent 
groups 1s important, in view of the extent to Which they overlap. Such 
doubts may appear more reasonable if we examine the composition of 
the distributions first in X and afterwards in Y, instead of attempting to 
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view it in X and Y simultaneously: A is not distinguishable from D in 
X, nor B from C; conversely in Y A is not distinguishable from B nor 
C from D. 

To investigate such possibilities systematically we must extract the 
matrix U. Just as the pair of measurements obtained in a single case 
specifies one point in the 2-space, the mean value of x and mean y fora 
particular group specifies one bivariate mean point. When the number 
of measurements increases to m the space becomes m-dimensional but 
the set of means for a particular group still specifies only one multi- 
variate mean point, just as the set of measurements for an individual 
specifies a single point. The n, points defined by the measurements 
obtained from the individuals in group A form an ellipsoidal cluster 
about the multivariate mean point X';, the 7% points from group B 
cluster around 3^, etc. Moreover the g points specified by the group 
means X', X... X', form an ellipsoidal cluster about their general 
mean x’, which is taken for convenience as the point of origin. The form 
of their dispersion is specified by the variance-covariance matrix 
between groups, U. . 

Before considering the properties of U further, let us examine those 
of T. When the dispersion between groups has been extracted the 
distance between the multivariate mean points disappears. Under such 
conditions the four ellipses in Fig. 7.1 must become superimposed on 
one another with their centres all coinciding at the origin O. The 
residual dispersion, T = S — U, forms a single ellipsoid representing 
the dispersions within the groups collectively, provided that they all 
have approximately the same form. A method of testing the validity of 
such a postulate is given by Kendall (1957. — ~ 

Returning to U, we need to note that there is an upper limit to the 
number of dimensions that can be occupied by the dispersion of the 
group means X^, ¥’,. . . X; about their general mean. When m > g 
it is restricted to g — 1 dimensions at most, for any two points can be 
connected by a straight line, any three points by a plane, etc. (The 
general mean <’ is a linear dependent junction of the group means so 
must be coplanar with them.) When g > m the dispersion may extend 
into all m dimensions, but of course no further. 

To distinguish between the members of any two groups as accurately 
as possible we need to find the axis along which the distance between the 
two mean points is greatest, not in absolute magnitude but relatively rà 
the variation within the groups. The discriminant function already 
described defines this axis as the y vector. Let us say Yan 1 the vector 


required for distinguishing between the members of groups A m B. 
Another vector Yas serves similarly to distinguish between the members 
h between the members 


of groups A and C, a third, Y» to distinguis 
of groups B and C, etc. So with g groups In all g (g—1)/2 vectors are 
obtained, each describing a different axis. 
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All these axes must lie in a space of not more than g—lorm 
dimensions, whichever is the lesser, and may perhaps be confined to a 
space of even fewer dimensions. Without noticing that we had done so, 
we might perhaps compute g (g — 1)/2 vectors which all lie approxi- 
mately in a space of two dimensions and even do not diverge greatly 
in the second (though it would be unlikely to escape notice if they all 
tend to coincide along a single dimension). 4 finding that all the y 
vectors are located in a sub-space of relatively few dimensions might be 
of considerable theoretical interest in certain contexts. 


FIG. 7.2. 


simultaneously to the same extent, and its major and minor axes become 
apparent. Its major axis now defines the dimension in which the varia- 
tion between the groups is greatest compared with the variation within 
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In general an m-dimensional T is scaled down so that its principal 
axes are all of equal magnitude when it is post-multiplied, or pre-multi- 
plied, by its reciprocal, T-1. U is reduced proportionately to UT~’. Its 
latent roots and vectors are now defined algebraically by the equation 


[UT-1]x = Ax 


The first latent vector obtained by the arithmetic solution of a particular 
equation of this kind is the dimension where the variation between the 
groups is largest relatively to the variation within the groups, the 
second the dimension orthogonal to the first where variation is rela- 
tively next largest, etc. Let us denote these vectors as the 6 vectors, the 
latent roots as à, ó; . . . etc., and the space they define as the D-space. 
Have we any reasons for expecting that the latent vectors of UT}, 
i.e. the ô vectors, will coincide with those of U or T or S=T + U,i.e. 
the / vectors obtained by principal component analysis of one of the 
latter matrices, or with the f-vectors obtained by some other method of 
factor analysis? There is a coincidence in Figs. 7.1 and 7.2 which is 
perhaps worth examining. It can be shown in general that the latent 
vectors of T-! are the same as those of T, and its latent roots are the 
reciprocals of those of T. So the vector of the smallest root of T is that 
of the largest root of T-!, etc. Now in Fig. 7.1 the dispersion of the 
mean points is circular, so the latent roots of UT- in Fig. 7.2 are the 
same as those of T in Fig. 7.1, but the relative magnitude of the roots 
is reversed. . . 
For an illustration of another kind, consider the effect of including 
in the experiment one measurement in which there is some variation 
between groups, however small, and none whatever within groups. 
Contracting T in the manner described will eliminate all the other 
variables, no matter how spectacular their variances, and leave this one 
sufficient by itself to effect accurate discrimination. T 
Evidently it is possible to excogitate a number of specific instaness 
when simpler or more complicated relationships can be demonstrate 
between the latent roots and vectors of UT- and those of U, T or S. 
But no universal mathematical relationship can be found. Mathematical 
considerations, that is to say, do not suggest that the ratio of the 
variance between groups to the variance within or the total is likely to 
be greater in dimensions where the latter variances are large than where 
they are small. Nor does any such expectation follow from the null 
hypothesis. There remains, however, the possibility that psychological 
arguments may indicate expectations in some contexts. : " 
The possibility that the matrices obtained in a particular ane 
have the same latent vectors is quite simple to check. S, the sum o. 
and T, and UT-}, the ratio of U to T, cannot have the same latent 
vectors as each other, or as U or T, unless U and T have the same 
vectors as one another. Now the product of two symmetrical square 
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matrices is itself symmetrical when and only when the vectors of the two 
are the same. So we have only to inspect the product matrix UT-! and 
see if it is symmetrical, to discover whether U and T have the same 
latent vectors. We must presume that in general they have not, although 
admitting that there may be some instances where the vectors coincide 
approximately or even exactly. 


3. AN EXAMPLE, ILLUSTRATING THE COMPUTATIONS 


The data used here for example come from the experiment described in 
another chapter by R. Payne. They have been reduced to three measure- 
ments on five groups of subjects. Table 7.1 gives the composition, 


TABLE 7.1 
No. of Mean in Variables 
Group Cascs I Hi 
1. Normals 20 7:8513 15-6859  —4-1676 
2. Dysthymics 12 149372 16-0880 —4-0299 
3. Hysterics 8 13:4677 15-9608 —5-7374 
4. Depressives 20 239145 164368  —1:9892 
5. Schizophrenics 20 26:7674 20-2694  —5:8713 
Sub-totals 
2 + 3 All neurotics 20 14-3494 16-0371 —4-7129 
1 +2 + 3 Neurotics and Normals 40 — 11-1004 158615  —44402 
4 - 5 Psychotics 40 — 253410 183531 —3-9302 
Grand Total 80 — 182206 171073 —41852 
TABLE 7.2 
Dispersion within groups, T 
I IL Ill 
I 9004-3515 —571:8267 761-3268 
II —571-8276 1853-0493 —413-4285 
ul 761-3268 —413-4285 954-3343 
TABLE 7.3 
Dispersion between groups, U 
I Ir Hi 
I 4569-9025 842-6949 11-1115 
Il 842-6949 2712-3667 — 124-2455 
Tl 11-1115 —124-2455 1728772 


numbers and mean measurements of the groups, Table 7.2 the total 
dispersion within groups, which provides an example of T with 75 
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degrees of freedom, and Table 7.3 the dispersion between groups as an 
example of U with 4 d.f. 

A convenient way of finding the reciprocal of T is to resolve it first 
into the product of a triangular matrix and its transpose. Writing out 
the equation LL’ = T in terms of its elements as 


a 0 a b f tie hs 
r c 0 | c e|— E ton tag 
d é f. 0 0 f. fi tes t 


we find the elements of L from 


ty =a" a= Vt 
fia = ab <. b = t/a 
tı = ad <. d = t/a 
tog = b + c* 4, C= etc: 


Thus a simple routine is to set L under T and compute its entries column 


by column beginning at the top left-hand corner. : 
Next the reciprocal of L is found. Writing out the equation LL- =I 


a 0 « 0 10 
E:3 23-513 
defi lõ egl lool 


we find the elements of L^! from 


as 


ax = 1 pe ia 
=i Jylc 
tzi $= 1/f 
ba + cB =0 <. B = —ba/c 
ey +fe=0 ..e=elc. 


L-! is set under L as before, but work is begun down the leading 
diagonal, continues down the first off-diagonal, etc. The L for the 
example is given in Table TA, Lin Table 7.5 and T7}, which appears 
in Table 7.6, is computed as (L7 L~}, i.e. L- premultiplied by its 


transpose. 
TABLE 7.4 
L, where LL’ = T 
I Il It 
I 94-89126 00 00 
II —6:02614 42:62317 0-0 
il 8:02315 —8-56529 28-57620 
263 


P. Slater 


TABLE 7.5 
L-1 
I Il m 
I 0-01053838 0-0 0-0 


II 0-00148994 — 0-02346142 0-0 
IN —0:00251220 0-00703221 0:03499416 


TABLE 7.6 
T 
I II Il 
I 0-0001 195885 0:0000172898 —0:0000879123 
II 0-0000172898 0-0005998902 0-0002460863 
IN —0-0000879123 0-0002460863 0:0012245912 
TABLE 7.7 
Ur 
T II HI 
I 0:56010097 0-58727150 —0:18076792 
II 011640849 0:14738503 —0-15920743 
Hi —0:01601740 —0:03179883 0-18015194 


Hence we can obtain UT-1, which is conspicuously asymmetrical, as 
appears in Table 7.7. To simplify the work of obtaining its latent roots 
and vectors, we may substitute a symmetrical matrix for it and then 


apply the computing methods already mentioned in Chapter 6, Sect. 1. 
Putting the characteristic equation 


[U — 2TIx =0 
into the form Ux = ATx and substituting T = LL: 
Ux = ALL’x 
For any non-zero 4 we may premultiply both sides by L~ and obtain 


L^!Ux = AL'x. Now if we Substitute L’x = z so that x — (L)^1z, we 
find 


we have 


L-U(L)-1 = 37 


So the matrix L-1U(L’)-1, which is Symmetrical, has the same latent 
roots as UT—!; and the latent vectors of UT-1, viz, x, can be obtained 
from x = (L')-!z. Note that (Lt = (L-1. 

The example of L-!U(I/)-1 is given in Table 7.8. Its latent roots, 
which are also the latent roots of UT™1, are given in Table 7.9, to- 
gether with its latent vectors. The latent vectors of UT-1 appear in 
Table 7.10. 
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TABLE 7.8 
CUY" 
I Il IH 
I 0:50752173 0-28010698 —0:05443773 
II 0-28010698 0-21898053 —0:11443532 
Il —0-05443773 —0-11443532 0:16113579 
TABLE 7.9 
Latent roots and vectors of L- U (L7 
a a ai 
0-699512 Dres 0-011842 
1 a 3 
I 0-691013 0-167797 — 0-043209 
II 0:441802 —0:127876 0-086250 
Il —0:163779 0:363010 0-050356 
Note: The vectors / are given in the scale // = 2. 
TABLE 7.10 
First two latent vectors of UT, normalized 
d ð, 
I 0-609948 0:052309 
I 0-672879 —0:035149 
III —0:418566 0-998012 


Comparing the mean square variances within groups with the m.s.v.'s 
- P groups for the three variables, given in Tables 7.2 and 7.3, we 
nd 


for Variable 
II lll 

m.s.v. within 120-06 24-71 12:72 d.f. 

m.s.v. between 1142:48 68:09 43:22 75 

Variance ratio 9-516 2-756 3-397 4 

Probability «0001 <0-05 «0:025 


ences between the groups in all three 


measurements. But the dispersion between groups is not thereby shown 
to be 3-dimensional: it could conceivably be located in two dimensions 
only, or even only in one. The tests of significance are inconsistent with 
the possibility that all the group means are clustered together too closely 
to be distinguishable from the general mean, but not inconsistent with 
the possibility that they are strung out alonga straight line in the 3-space. 

The canonical analysis applied to the example of UT-1 in Table T 
shows that this alternative, of collinearity, is unacceptable. The first two 
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latent roots are both found to be significant. For a general test of the 
significance of the latent roots of any UT? we may enter 


(N-g) 4, as a 7? with m + g — 2 d.f. 
(N-g) A, .. w m+g—4df. 
etc. Thus for the latent roots given in Table 7.9 we obtain 
Xa for 4, = 52-46 with 6 d.f., P. < 0-001 
à = 13-22 4 < 0-025 
à,— 089 2 > 0-75 
The dispersion between the groups is thus found to be located almost 
wholly on a plane in the 3-space defined by the vectors 6, and 6, and 
illustrated graphically in Fig. 7-3. (The co-ordinates of the group means 
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HG. 7.3. Principal Components of the Dispersion between Groups. 
Group means: 


Group ô, 5, 
Dp. Depressives 2648 E 22. 
Ds. Dysthymics 21:62 —3:81 
H. Hysterics 21:36 —5:58 
N. Normals 17-09 —4:30 
S. Schizophrenics 3242 —17 
Axes pass through the general mean 24-38 —3-82 


are obtained by post-multiplying the Table 7.1 means by the Table 7. 10 
vectors). Using a break-down of the sum of the latent roots for descrip- 
tive purposes we might say that 78:8% of the variance between groups 
occurs in the dimension defined by 6,, 19:9% in ô, and only 1:3% 
elsewhere. The main features of the dispersion are that the normals and 
the two neurotic groups, hysterics and dysthymics, are centred fairly 
closely together and that the two psychotic Broups, the depressives an 
the schizophrenics, are well separated both from the non-psychotic 
groups and from one another. 
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4, OTHER FEATURES OF THE EXAMPLE 


Much the same conclusions can be drawn from an examination of the 
discriminant functions. To compare firstly the two neurotics groups we 
set down the mean differences obtained from Table 7.1 and premultiply 
by T-1 (Table 7-6) to arrive at the discriminant function, viz.: 


9 Mean Discriminant 
differences function 
o) 
I 1-4695 0-00002782 
I 0:1272 0-00052191 
il 1-7075 0:00199310 


This computation gives the elements of y on a scale suitable for the next 
step: d'y — D, which leads to D — 0:00351050 here. The test of signi- 


ficance then takes the form 
Source of variance Sumofsqrs. df. 


i nan - 
Difference between group means { ot } D* m 


Na + No 
Variance within groups D N-m-g 


(Where n, and 7, are the numbers in the groups compared). And the 
mean square variances are found to be 000001972 between groups and 


0:00004876 within. 
_ Concluding that the variables 
inate between these two groups we 


proceed to the next comparison, o : 
Means for all neurotics are listed in Table 7.1. Proceeding as before we 


obtain D — 0-00609547 for this comparison, m.s.v.’s = 0-00012385 and 
0-00008466 respectively, and thus again find no significant difference. 

We should not be mistaken, now, in concluding that there are only 
three well-established points to consider in the dispersion between 
groups: the mean points for the endogenous depressives and the 
schizophrenics and a combined mean for normals and neurotics 
together. There is thus no difficulty in understanding why the dispersion 
between groups is located on a plane. B 

In looking for axes to which to refer the variation between these 
means we are not restricted to the use of 61, and 6,. We may perhaps 
allow psychological or administrative considerations if they are suti- 
ciently cogent to persuade us of the advantages of using another pair. 
Using a discriminant function again to compare the schizophrenics 


with the depressives we have 


under consideration do not discrim- 
may combine them together and 
f the neurotics with the normals. 


d and hence Yı 
—2-8529 —0-00074872 
—3-8326 —0-00139313 

3-8821 0:00406164 


267 


P. Slater 


D = 0-02324305 and the m.s.v.’s = 0-00180080 and 0-00032282 respect- 
ively. The variance ratio is 5-578, and P < 0-01. And comparing the 
psychotics with the others 


d and hence Va 
—14-2406 0:00170126 
—2:4916 0:00186641 
—0:5100 —0-00061423 


Here the variance ratio is 13-858 and P < 0-001. When these two vectors 
are normalized they become 


n Ya 
I —0:1718 0:6736 
II —0:3196 0:7390 
m 0:9318 —0-0056 


The angle they subtend is approximately 70° (y’, y, = cos 0 = 0:3572, 
6 = 69° 10’). They are the axes preferred by Payne. 

A few supplementary details need perhaps to be added to the informa- 
tion about the variables given by Payne. The original observations from 
which they are derived are scores on a battery of 54 tests. The sums of 
squares and products of deviations from the general means for all 
N = 80 cases were scaled down to a 54 x 54 matrix of correlations, Rs. 
A centroid factor analysis was applied, with estimated communalities 
substituted for the units in the leading diagonal of R,, and three factors 


extracted. The unrotated loadings form a matrix, f, with 54 rows and 
3 columns. 


The three variables are the factor scores for these three factors. For 
the purposes of the analysis it is unnecessary to compute them for each 
case individually; we only need the data in Tables 7.1 to 7.3, relating to 
group means and dispersions. Using the definitions already given 

x, for an individual's score in test X 

X for the general mean in all N cases 

Vx = X(x, — ¥)?, the sum of squares of deviations from X, 
let us write 


5$, — x/ V Vx/(N — 1) 
and 
$— X/VVNx/(N — I) 


so that the individual's standardized score in X can be expressed as 
s; — $,1e.asa difference between two terms, and the row vector of his 
standardized scores in the 54 tests as s'; — 5’, i.e. as a difference between 


two vectors, one referring to him personally and one to the general 
mean. Analogously 


5, = X,/V Nx/(N — 1) 
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gives the characteristic element in the row vector 5’, of means for 
group G. 

The three factor scores for an individual could now be found, if 
required, from s’; f; but the five rows of group meansin Table 7.1 can also 
be obtained directly as 5’,f, and the final row of means for all 80 cases 
as 'f. The total dispersion of the factor scores between groups, which is 
given in Table 7.3, can be computed as 

U = En,f'5,.5',f — Nf SES 
That is to say, each row of Table 7.1 is premultiplied by its transpose to 
forma3 x 3 matrix, the elements of which are multiplied by the number 
of cases in the group. The 5 matrices so obtained are summed. Lastly a 
similar matrix formed from "f is subtracted. The total dispersion within 
groups given in Table 7.2 is obtained as the difference 


T=S-—U 


after S, the total dispersion about the multivariate mean for all groups 
combined has been found as 


S = (N — Df'R,f. 


5. RATIONALE 


A case for employing the analysis described in the two foregoing 
sections can be presented without postulating that the f-vectors (or any 
others derivable from them by rotation) have any particular psycho- 
logical connotation. 

Since there are only 80 cases to allocate and 54 scores are recorded for 
each, it is almost a foregone conclusion that a set of linear combinations 
of the scores can be found to provide an accurate allocation procedure. 
The immense labour of the computations will lead to a trivial result, for 
however satisfactory it appears to be it will not be safe to generalize. 
But to show that a greatly reduced number of measurements will serve 
the same purpose may be of interest; so some preliminary procedure is 
Worth considering for reducing the scores to an appropriately small 
number of variables. 

_ The centroid factor analysis serves this purpose as a rough approxima- 
tion to a principal component analysis, cf. Burt (1940). The three columns 
of f, treated as vectors, define approximately the first three principal 
axes of R,—the three orthogonal dimensions of variation where differ- 
ences between individuals are most conspicuous and confirmed most 
reliably by the cumulative evidence of all the tests. If the search for 
evidence of discrimination is to be restricted to only three out of the 54 
dimensions where it may be found, these have a claim to priority. The 
claim would have been better of course if the principal components had 
been located exactly; and the subsequent computations would have 
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been simplified as exact orthogonality would have been obtained; but 
as a matter of historical fact a centroid analysis was used. 

The canonical analysis demonstrates that the group means form a 
two-dimensional scatter as shown in Fig. 7.5. Purely from considerations 
of administrative expediency we might prefer to substitute two other 
axes for the principal axes of this dispersion, and employ firstly a 
measure for distinguishing psychotics from nen-psychotics, viz. yı, and 
secondly when applicable a measure for subclassifying the psychotics 
into schizophrenics and depressives, y+. The successive operations in- 


volved in obtaining the two measures for an individual are defined 
algebraically as 


(5; — Sy 
Consequently the product matrix fy (cf. Payne’s chapter) defines the 
two required vectors in the original m-space. 

In proceeding further to consider the relevance of the evidence to the 
psychological theories proposed by Payne several points seem worth 
noting. The tests were originally selected to ensure that their cumulative 
evidence would provide reliable measures of two psychological traits, 
one believed to distinguish psychotics from non-psychotics and the other 
schizophrenics from depressives. A search restricted to the first three 
axes of R, therefore seems sufficient for the purpose of investigating the 
theory. Support for it comes from the findings that the constellation of 
the group meansis restricted to two dimensions within the three investi- 
gated, and Separates the groups in the expected manner—without, for 
instance, discriminating between neurotics and normals. Further sup- 
port is obtainable from a detailed examination of the elements in fy» 
Which tends to show that the weights assigned to the tests for estimating 
individual differences in the discriminant dimensions accord with their 
postulated relevance to the two psychological traits. 

However, it should not be forgotten that the search for discriminance 


has been very incomplete. Only thr t i i been 
explored. It would thus be weil = pa ils spa 


: 1 to avoid drawin. i restrictive 
conclusions, such as that the tests serve to haie em Has than 
the two described, or that they provide no way of distinguishing hyster- 
ics from dysthymics. We cannot even be sure that better discriminants 
for the proposed purposes do not occur in some other sub-space which 
has not been investigated. The vectors found are lines of best fit only 
within the chosen sub-space; they are optima in a very restricted sense. 

The danger that valuable evidence may be overlooked as a result of 
an incomplete factor analysis will be illustrated in Chapter 8. 
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1. HILDEBRAND’S EXPERIMENT 


THE data now to be examined were collected by H. P. Hildebrand 
during a course of studies for a Ph.D. at this Institute. His findings, 
based mainly on a centroid factor analysis, were published first in his 
thesis (1953) and later in an article in the British Journal of Psychology 
(1958). Eysenck has quoted them extensively (e.g. 1957) and asked me to 
make some further analyses of parts of the data, using the methods 
described in Part II. Most of this was done during 1956-7; the electronic 
computation by the National Physical Laboratory under the supervision 
of Mr. J. G. Hayes. L. H. Storms, who was pursuing a course in statistics 
here at that time, used other parts of the data not too bulky to be treated 
by electric calculator for a parallel analysis, and published his findings 
in the J. Ment, Sci. in 1958. Anyone who goes to the trouble of 
reading all three texts is likely to find differences in the authors’ inter- 
Pretations of the results and may even come to the opinion that the 
different methods of analysis conflict. In this account I shall attempt to 
show where the differences originate and suggest a synthesis. 
Hildebrand’s research exemplifies the hypothetico-deductive method. 
To understand what he did one must know what postulates he had in 
mind and how he proposed to test them. He supposes that introversion- 
extraversion (hereafter denoted I-E for short) is a major dimension of 
personality, and follows Jung (1924) in considering that it occurs 1n its 
entire range among normal subjects. He does not accept Freud s 
Opinion (1920) that neuroticism is associated with introversion and 
normality with extraversion. He adopts Eysenck’s view that the 
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distinction between neurotics and normals is to be found ina dimension 
of neuroticism which is orthogonal to I-E, and follows him further in 
supposing that I-E serves to subclassify neurotics though it does not 
distinguish them collectively from normals. The anxiety states are ex- 
pected to occur preponderantly among introverts and the hysterics 
among extraverts. 


His experimental groups consisted of a 

No. of cases Diagnosis 
25 Anxiety states 
10 Obsessional neuroses 
10 Reactive depressions 
25 Conversion hysterics 
20 Psychopathic states 
55 Mixed neuroses 
25 Normals 

170 Total 


The neurotic subjects were in-patients at Belmont Hospital, Sutton, 
Surrey. They were all males, aged between eighteen and fifty, with IQ's 
above 90, showing no signs of organic lesion or psychotic illness, and 
not undergoing physical treatment by ECT or insulin therapy. The 
‘mixed neuroses’ were cases where the diagnosis was not clear-cut. The 
control group of twenty-five normals (the H-controls) was drawn from 
soldiers passing through an Army Reallocation Centre in the neighbour- 
hood. This selection may have been unfortunate. Reallocation is pro- 
vided by the Army for men found unsuitable for the duties to which 
they were posted as new recruits, and the centres are places where all 
sorts of misfits collect. Eysenck replaced this group later by a sample of 
twenty-two men (the E-controls) mostly working-class people such as 


the neurotics, who were contacted individually and invited to attend 
the Institute for testing—none refused. 

The original test programme was to have yielded thirty-two measure- 
ments in each case, but some omissions and some amalgamations of 


the experimenter. Self-rating scales are thus included, There were three 
measures of intelligence: Progressive Matrices, the Mill Hill Vocabulary 
and the Nufferno Power test; four of neuroticism: the Maudsley 
Medical Questionnaire and the lie scale, an Annoyances test based on 
the Pressey X-O scale, and the combined D and C scales from the 
Guilford-Martin inventory; the remaining fifteen were expected to 
exhibit variation in I-E. They are enumerated in full in Table 8.1. 
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TABLE 8.1 
Data extracted from Hi ildebrand (1953 and 1958) 
Variance Factor loadings 
Test Ratio N I-E 
A. Also administered to the second control 
group 2 
Mill Hill Vocabulary 1-765 — 0-039 0:110 
Maudsley Medical Questionnaire 11154 0:739  —0:257 
Lie scale 2:665* —0:380 —0:371 
S 1-448 0739  —0472 
ES scales from the Guilford-Martin 1:776 0:467 0:099 
Inventory (D and C scales totalled 1-185 , j 
Cļin the re analysis) 17719. 0831 0057 
R 3.944** — —0:246 0-563 
Annoyances 2:131 0.429 | —0012 
Weighted wrong Directions and 1:642 0:257 0:360 
Crossed Lines from Porteus Maze Test 1-800 0:067 0:425 
B. Not administered to the E-controls 
Progressive Matrices 1:084 0179  —0014 
Nufferno Power test 1:080 0:098 0:012 
Sex Humour 0:916 0:301 0-222 
Static Ataxia, Total distance 1:153 0:086 0:180 
3 » Reversals 1:011 —0:041 0:073 
Suggestibility, Total distance 2-414* 0-176 0-052 
» Reversals 3-936** 0-021 0211 
Body build Index 0:570 0-015 —0:255 
Starting time 2:251 —0:167 —0:195 
Total time from the Porteus 1246 —0062  —007 
Lifted pencils Maze test 1-029 0070 0-006 
Leg persistence 2-791* 0113 —0-066 
C. Not included in the factor analysis 
Extrapunitive humour 1:108 
Incongruous humour 0-953 
Food aversion 0:251 
Aesthetic differences 1-266 
Static Ataxia, Forward distance 1:945 
» » Backward ,, 1:039 
Suggestibility, Forward — ,, 2:007 
» Backward ,, (n.s.) 
Wavy Lines (Mazes) 1:570 
Note; The variance ratio tests the significance of the differences between the means of the 
neurotic groups. *0:05 < P < 0:01 **P «001 


Hildebrand's argument leads him to expect that the dimensions of I-E 
and neuroticism can be located by a factor analysis restricted to the 
data remaining after the anxiety states, the hysterics and the normals 
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have been excluded to provide criterion groups. Then when factor 
scores for I-E and neuroticism are calculated for everyone it should be 
found that the means for anxiety states and hysterias differ significantly 
on the I-E scale, where the normals are to be found in a central position 
with the mixed neurotics; while on the neuroticism scale the normals 
should be separated from the other groups. 

The correlations computed from the restricted data were subjected to 
a centroid analysis to obtain first estimates of the factor loadings for 
three factors. Two iterations were carried out by Lawley’s maximum 
likelihood method (1950); and the resulting residuals were tested for 
significance by Bartlett's test (1950), and found to be still significant. A 
fourth factor was accordingly extracted. On the basis of this evidence 
Hildebrand concludes that all four factors are significant. Two rotations, 
he adds, were sufficient to adjust the factors to psychologically meaning- 
ful positions. The resultant loadings for the first and third factors are 
reproduced in Table 8.1. 

The four factors together account for 32% of the total variance. The 
first accounted for 12% and was, to quote, ‘easily identified as a neuro- 
ticism factor. The four variables with the highest loadings differentiated 
significantly between normals and neurotics. Moreover, previous studies 
had shown these tests to be highly saturated with neuroticism, emotional 
instability or similar factors. The second accounted for 9%. It was 
confidently identified as an intelligence factor. The first three variables 
were the three tests of intelligence, and there are high correlations with 
mistakes and length of time on the Porteus Maze test. The third 
accounted for 6%. Critical ratios were calculated for the scores of the 
pe highly loaded of the variables between the hysterics and the 
: rds da a = B discriminate between the groups.’ (The 
the WWD and X lines sc : and R scales, Suggestibility reversals, an 
el i mae Tes on Porteus.) ‘For these reasons the factor 

de -E factor. The fourth factor extracted was identified 
rin from the intercorrelations of the static ataxia and suggesti- 
inc aon acm Pesto ni i more mening 
I-E and neuroticism em n Tere then ape puted fonema nE n 
ipd » and the scores were calculated for every 
individual.! The group means were found to be 


Group Neuroticism LE 
Anxiety states 5:949 1:407 
Obsessional neuroses 5:424 1:082 
Reactive depressions 4-647 1:295 
Conversion hysterias 4:866 2241 
Psychopathic states 5:916 2:637 
Mixed neuroses 5:364 1-593 
H-control normals 3.402 2:875 


1 This part of the work, which is exceedingly laborious and exacting, was done by 
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Hildebrand claims that apart from the position of the normal group at 
the extravert end of the I-E continuum, ‘inspection of the means 
demonstrates clearly that the distribution follows the prediction’. He 
points out that the representativeness of his control group is open to 
question, and remarks, ‘It is possible that if a less homogeneous and 
more representative civilian sample had been tested the predictions 
would have been fulfilled.’ 

He is, however, quite reserved in his appraisal of the importance of 
the I-E factor. ‘The experiment was purely on a descriptive basis,’ he 
comments. ‘Whether or not the factors found, particularly the I-E 
factor, may be considered to have hypothetical causal instances! 
underlying them is not a question that can be answered on the basis 
of the data here presented. A certain amount of caution must be used 
before the factor can be used as a basic postulate in personality research. 
Although the experiment was designed to amplify the factor as much 
as possible, yet the total variances it accounted for was only 6%, only 
1% more than that of a factor found impossible to identify satis- 
factorily.’ 


2. EYSENCK’S USE OF THE FINDINGS, AND HIS REASONS FOR 
TAKING A SECOND CONTROL GROUP 


Eysenck (1947) goes much further. He describes extraversion-introver- 
sion as a type concept with the ‘highest level of generality’. He connects 
the concept with Hull’s law of reactive inhibition (1943) in a formulation 
linking personality theory with learning theory: ‘Individuals in whom 
reactive inhibition is generated quickly and strongly, and dissipated 
Slowly, are thereby predisposed to develop extraverted patterns of 
behaviour, and hysterico-psychopathic disorders in cases of neurotic 
breakdown; conversely individuals in whom it is generated slowly and 
weakly, and dissipated quickly, are thereby predisposed to develop 
introverted patterns of behaviour, and dysthymic disorders in cases of 
breakdown.’ He follows this up with a great array of evidence, which 
needs not be reviewed here. : 
To confirm the theory evidence would seem required showing 


(1) That a certain degree of I-E can be inferred in an individual from 
the consistency of his results in several relevant tests. ie 

(2) That a certain degree of liability to reactive inhibition can likewise 
be inferred from the consistency of his behaviour in various learn- 
ing situations. 


Miss le Prevost. Only the eight highest regression coefficients were used in computing 
the factor scores. 
The phrase is borrowed from Eysenck 1953a. 
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(3) That assessments of the two characteristics will be found related 
in a specified way. 

(4) That among neurotics both assessments will be found related to 
diagnostic differences in a specified way. 


Compared with other sources, particularly the psychoanalytic ones, 
which might be cited in support of the view that many aspects of 
behaviour reveal differences in I-E, Hildebrand's experiment has the 
advantage of specifying a weighted summation of scores on a particular 
set of ‘objective’ tests as appropriate for measuring it. Moreover, 
Hildebrand adduces evidence that these measurements distinguish 
between neurotics in the specified way. It is not surprising that Eysenck 
relies on Hildebrand's findings as his main support for the part of the 
theory concerning I-E as a personality trait. 

The greatest weakness of the findings from Eysenck's point of view 
is that they do not accord with the expectation about the mean score of 
normal people on the I-E scale. The H-controls’ mean places them at 
the extraverted end, beyond even the psychopaths, while the remaining 
groups lie fairly close together near the other end, and the difference 
between the anxiety states and the hysterics is not conspicuous. The 
variance between the groups could be broken down as follows 


Sums of 
Source squares d.f. 
Between Normals and Neurotics 26:47 1 
Between Psychopaths and other neurotics 17:81 1 
Among other neurotic groups 14-77 4 
Variance within groups, total 270-57 163 


And the variance among the neurotic groups excluding the psychopaths 
might accordingly be dismissed as non-significant (the F ratio being 
2:22). However, when the anxiety states are compared directly with the 
hysterics a significant difference is found on a one-tailed test in the 
direction required by the hypothesis, and this may be accepted as suffi- 
cient confirmation. 

Three explanations might be put forward to account for the anoma- 
lous mean score of the H-controls: the least disturbing is that the H- 
controls are not a representative sample of normal men. The alterna- 
tives are that the scale is not correctly identified as an I-E factor, or 
finally, if it is, that Jung's view of the frequency distribution of I-E in 
the normal population must be discarded in favour of Freud's. If a 
mean nearer the general average for all neurotics is obtained when 
another sample of controls is drawn from a less Suspect source, Eysenck 
argued, the new evidence will decide in favour of the first alternative 
and the original theory will be confirmed. This was his reason for taking 
the second sample, the E-controls, and giving them only the reduced 
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battery of tests (cf. Table 8-1) which would be sufficient, according to 
Hildebrand’s factor analysis, to provide reliable assessments of neuro- 
ticism and I-E. 

Factor scores calculated from the reduced battery are of course not 
exactly comparable with those from the complete battery, quoted above. 
To obtain comparable estimates for all the groups the procedure 
appropriate for the E-controls must be extended to the others. Table 
8.2, which gives the correlations between the tests, is completed by 
symmetry with units in the leading diagonal to form a matrix R. 
Neuroticism and I-E are treated as criteria, as explained in section 2, 
Chapter 6; their correlations with the tests (y) are provided by the 
factor loadings in Table 8.1. The values of x which satisfy the equation 


x-—Ry 


provide the partial regression coefficients, x — f, for estimating 
neuroticism and I-E, as defined by Hildebrand's vectors, from the test 
Scores on the reduced battery. They are found to be 


Test for Neuroticism for I-E 
MHV 0:0408 0:1400 
MMQ 0-1511 —0:1894 
Lie scale —0:1166 —0:2510 
Annoyances 0:0108 0:0156 
S 0:1906 —0:3355 
T —0:0271 0:0263 
D+€ 0-2985 0:2493 
R —0-0295 0:1842 
WWD 0:1238 0:1597 
X lines —0-0701 0:1062 
Multiple correlation 0:7548 0:7338 


The multiple correlations OPR, show that both estimates are reasonably 


tas The same statistics give the s.d.’s of the estimates within groups, 
ince e 


" »'B — FR 
Nera scores do not have to be calculated for each individual to find 
€ means for groups. The procedure described in section 4, Chapter 7, 
A be followed. The mean test scores for the groups are given in Table 
m Each column of entries constitutes a vector X,; division by the cor- 
‘€sponding entries in the column of s.d.'s gives 35; and the product 
Sap Elves the mean factor score, listed at the foot of the table. i 
" phe bivariate distribution of the means is shown graphically in Fig. 
:l. The axes are set through the general mean point for all neurotics, 
v coincides almost exactly with the mean point for the mixed neuro- 
LE The separation of the anxiety states from the hysterias along the 
75 axis, though not conspicuous, is in the right direction, the deviation 
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of the anxiety states towards the introverted end of the scale being more 
marked than that of the hysterias towards the extraverted end. 

When the corresponding graph given by Hildebrand is compared with 
it, the alteration in the location of the control group is the main differ- 
ence found. There, as already mentioned, the mean for the H-controls 
lies at the extraverted end of the I-E scale, beyond the mean of the 
psychopaths. Here the mean for the E-controls, while still well away 
from the means of the neurotics on the neuroticism scale, lies not far 
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riG. 8.1, The Distribution of Group Means for Neuroticism and I-E estimated me 
Scores on the Reduced Battery of Tests. The Axes are drawn through the Bivar! 
Mean Point for all Neurotic Cases (N = 1-744, I-E = 2646). 


from their general mean on the I-E scale. One cannot accept both groups 
as representative of normal men, and doubt falls on the H-controls i 
view of their suspect origin. The E-controls conform much more closely 
with the expectations deduced from the general theory than the 

controls. Therefore confirmation for the theory has been obtained. " 

In my opinion Hildebrand's investigation and Eysenck's continuatio 

of it combine to form a very competent as well as important rese 
the factor-analytic kind. It is not open to criticism except In 
extending to methods of centroid analysis in general. Their use 


method compares well with other investigators’. 
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As a result a weighted summation of scores on a battery of tests (i) is 
accepted as measuring I-E, (ii) is found to discriminate between neuro- 
tic groups. It is accordingly approved for use in further researches. If 
they show (iii) that it is also associated with liability to reactive inhibi- 
tion in the postulated way the outcome of the larger project to which 
they belong will be judged satisfactory on the whole. A connection in- 
directly linking differences in reactive inhibition with diagnostic differ- 
ences will have been formed and explained. The argument will not be 
invalidated even if evidence is adduced, now or later, to show that some 
other weighted summation of the same or different tests either provides 
a better measure of I-E or discriminates more efficiently between 
neurotic groups. 


3. THE DISCRIMINANCE OF THE TESTS 


Hildebrand has not convinced me that the vector he has chosen for 
measuring I-E is the optimum one for this purpose. But this comment 
is not a criticism of his treatment of the data. I know of no line of 
Teasoning which leads to an indisputable optimum for the vector defining 
a postulated factor such as I-E. I am therefore likewise debarred from 
Proposing any other vector as preferable. Eysenck’s acceptance of 
Hildebrand’s findings has established a consensus of opinion I can see 
no advantage in disturbing. On the contrary there is an advantage in 
having a scale for measuring I-E precisely even if arbitrarily defined; its 
Properties can then be examined in detail. 

When we enquire what test, or what linear combination of the tests is 
best for discriminating between the neurotic groups, we set out to search 
the data for evidence of quite a different kind. Although Hildebrand 
anticipated that his I-E factor, when identified, would be found to 
distinguish between the neurotic groups in a particular way, he did not 
Postulate, as far as I am aware, that it would provide the best weighted 
Summation of the tests for this purpose. So there is no cause for dis- 
appointment in finding that the merits of the LE scale are neither 
negligible nor outstanding in this respect. 

, In other words, we are now propounding a question he never con- 
Sidered, when we enquire what would be the effects of substituting his 
vectors for the y- or ó-vectors which would give the best results. All 
three define sets of weights for use in summation. When optimum 
Weights are used, a summation of two measurements must give a better 
estimate than either alone, and each additional measurement will 
piu the accuracy of the estimate even if only to a negligible extent. 
ben When unsuitable weights are used adding poorer measurements to 
em ones may produce a less accurate estimate than omitting them. 

€ f-loadings tend to have this effect. 

For discriminating the E-controls from the neurotics collectively the 
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best single test in the ten-test battery is the Maudsley Medical Question- 
naire. The second is the D plus C scale from the Guilford-Martin 
inventory. The neuroticism index, although it includes the MMQ and 
D plus C scales, comes third. The variance between controls and neuro- 
tics, expressed as a proportion of the total sum of squares, is 


29-895 on the MMQ ; 
16:7% ,, , D + C scale 
11-4% ,, ,  Neuroticism index 


On the other hand the discriminant function is significantly better than 
the MMQ alone; the proportion of its total sum of squares due to the 
variance between controls and neurotics is 38:195. Its vector is given 
in Table 8.4. 

The I-E index is more successful; in discriminating between the 
neurotic groups it comes a close second to the Guilford R scale. The 
variance between groups, expressed as a proportion of the sum of 
squares for all neurotics, is 


11-895 on the R scale 
10:195 ,, ,, I-E index 


If we take for comparison the discriminant function maximizing the 
difference between the anxiety states and the hysterias, we find the 
percentage only rises to 16:0; the increment over the discrimination 
provided by the R scale is not significant. Group means measured along 
this axis are 


Anxiety states —0:7337 
Obsessions 0:3832 
Depressions 0:0728 
Hysterias 0:6596 
Psychopathics 0:1777 
Mixed neuroses 0:0255 
E-controls ^ 0-5478 


(When the origin is taken at the mean for all neurotics and the s.d. 
within groups is equated to 1.) It is interesting to see that an axis can be 
found which places the anxiety states and the hysterias at opposite 
poles; and also that in this dimension the mean for the normal y e 
(the E-controls) lies well away from the mean for all neurotics and clos 
to the mean for the hysterias. 2s 
In the complete set of discriminant functions obtained by pem 
each group with every other the normal group is found significant'y 
different from all the rest. But the vectors for every comparison do no 
seem of sufficient general interest to be worth tabulating separately. : 
only vector relating to the normal group given in Table 8.4 is the Es 
already mentioned, which discriminates them from neurotics collectively- 
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The remaining results are summarized below: 


Obs. Dep. Hys. Psy. Mix. 
Anxiety states — — ES Tb — 
Obsessions — — — — 
Depressions — ++ — 
Hysterias " + — 
Psychopathics + 


Mixed neuroses 


Note: +, discrimination significant at the 5% level 
++, » » » 9 1% » 


Evidently the psychopaths form an isolated group of cases. They are 
distinguishable even from the hysterics. The obsessionals are the only 
cases not clearly distinguished from them; and this is not so much 
because of any close resemblance (cf. the constellation of group means 
in Figs. 8.2, 8.3 and 8.4) as because of the shortage of obsessional cases. 
Apart from the psychopaths, there is very little evidence that the remain- 
ing groups can be distinguished from one another. The only significant 
discriminant is the one already discussed, which distinguishes anxiety 
States from hysterias. There is of course no justification for any cate- 
Borical conclusion that any of these groups are indistinguishable. A 
theoretical possibility always remains open that some other tests dis- 
criminate, and in section 6 below it is shown that some of the tests 
omitted from the reduced battery actually do so. 

All the vectors giving significant discrimination between one group 
of neurotics and another are listed in Table 8.4 


4. THE CANONICAL AXES OF THE DISPERSION IN THE D-SPACE 


Since the normals and the psychopaths are the only isolated groups 
and the remaining groups tend to cluster together, we might be led to 
€xpect that the D-space between the group means will not be more than 
Wo -dimensional. The expectation is strengthened because the tests were 
Originally selected by Hildebrand to provide discrimination in only two 
dimensions, neuroticism and I-E, and were screened again on the 
evidence of his factor analysis and only retained if their loadings 
indicated that they acted as measures of either or both. The rather 
Surprising finding is that there are three significant independent dimen- 
Sions of variance between the group means. AN 

The latent roots and vectors of the dispersion are given 1n the first 
Part of Table 8.5. The tests of significance show that for 


ô, 4? = 103-32 with 15 d.f. 

à, 5, = 39:93 4, 13 » 

ôs , =2864 , Ils 
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A Re-examination of some Data 
TABLE 8.5 
Latent roots and vectors of the dispersion of the means in the D-space 


PART 1. ð, LH LH 
Latent roots 0-64577 0-24954 0-17903 
Variable Latent vectors, normalized 
1. MHV ^ 01759 0-3450 —0-1031 
2. MMQ 0-6616 —0-1960 0-4354 
3. Lie scale 0:0715 —0:4277 0:3086 
4. Annoyances —0:2081 0:4318 0:2592 
5. S — 0:2544 —0:2063 —0:2585 
6. T 0:0414 0:2101 0:0003 
7.D * C 0:2131 0:0594 —0:4710 
8.R 0:0212 0:4663 0:2096 
9. WWD 0:0710 0:3502 —0-1219 
10. X lines —0-1064 —0-2071 0:5384 
PART 2. 
Group means 
Group 
Anxieties 0:4778 —0:4614 —0-2731 
Obsessionals 0:2240 0:2714 —0:4364 
Depressives 0:1034 —0:8979 0:3266 
Hysterics 0:1237 —0-0802 0:6707 
Psychopaths 0:4469 1:0603 0:0889 
Mixed states 0:1317 —0:0598 —0:1671 
Normals — 1:5677 0:0859 —0:0650 


The co-ordinates of the group means in these three dimensions are 
given in the second part of the table, and their scatter is shown graphic- 
ally in Figs. 8.2, 8.3 and 8.4. ) 

On the whole the relationship between the groups shown in these 
Braphs is consistent with the psychological theories of Eysenck and 
Hildebrand. In the first dimension the normal group is contrasted with 
the neurotic groups, all huddled together at the opposite end of the scale. 
This is seen equally clearly in Figs. 8.2 and 8.3. Just such a relationship 
1$ ascribed by them to neuroticism. In the second and third dimensions, 


as Fig. 8.4 shows, the anxiety states and the hysterias are well separated. 


A line passing through the two mean points would be nearly parallel 
to the axis of the third dimension, and would exhibit a relationship 
between the group means of the kind they attribute to LE. — i 
The surprising feature of the dispersion is thus its second axis, which 
arrays the psychopaths and the depressives at opposite poles and the 
normal cases midway between them, with most of the neurotics on the 
epressive side of the normals. For this result no prior hypothesis 
was advanced and no special selection of tests was made. It is perhaps 
all the more interesting on that account. 
285 


P. Slater 


A possibility worth considering is that phenomena at present being 
ascribed to I-E are really heterogeneous and should be ascribed to two 


& 


Dep. 
FIG. 8.2. The Dispersion between Groups on its First and Second Vectors in the 
10-Test Battery. 


independent factors. In his theoretical formulation, quoted in section 2 
of Chapter 8 above, Eysenck explicitly associates hysteric with psycho- 
pathic disorders on the one hand, locating them at the extraverted en 


" 


FIG. 8.3. The Dispersion between Groups on its First and Third Vectors in the 


10-Test Battery. 


of the I-E continuum, and implicitly associates reactive depressions 

with anxiety states on the other as dysthymic, locating them at the 

introverted end. Fig. 8.4 shows two independent axes, one running from 
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anxiety across to hysteria and the other from psychopathy to depression. 
Thus it illustrates the relationships he anticipated but expands I-E into 
a two-dimensional continuum. 
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FIG. 8.4. The Dispersion between Groups on its Second and Third Vectors in the 
10-Test Battery. 


5. THE PROJECTION OF A D-SPACE ON AN f-SPACE 


In the previous section the vectors derived from the discriminatory 
analysis were compared with those derived by Hildebrand from his 
factor analysis. The discriminating power of the factor analysis proved, 
Dot surprisingly, inferior to that of the canonical analysis. The latter 
technique is developed specifically for maximizing discriminance, while 
the former, as I have explained, is designed and was employed for a 
different purpose. We may however enquire why the deficit is as large 
as it is. 

In a limited sense the canonical analysis is an optimal procedure, and 
extracts all the information obtainable from the data about the differ- 
ences between the groups. Assuming that the multi-dimensional dis- 
Persion within groups is normal and homogeneous all the relevant 
Information is contained in UT~}. All the differences between the groups 
discoverable in the m-space are exhibited in the D-space. Extracting the 
Principal components of the dispersion there completes a search of the 
D-space and therefore also of the m-space. M 

But precisely the same results could have been obtained if T had been 
Tesolved into its principal components first. Instead of being expressed 
directly as derived functions of the test scores the canonical axes would 
then have been expressed as derived functions of the principal com- 
Ponents, themselves derived functions of the test scores. The procedure 
Would have been unnecessarily circuitous and laborious but could 

Sen carried through without any sacrifice of accuracy, and a complete 
Search would have been made. 
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Accuracy is lost inevitably at two stages in the course of a factor 
analysis of the kind employed by Hildebrand. The first loss occurs when 
the common factors are extracted for consideration and the specific fac- 
tor variance is discarded as negligible. The assumption that at some 
stage the residuals become insignificant is so widely prevalent that no 
one can be blamed for accepting it uncritically. Nevertheless it is open 
to criticism. For instance Kendall’s comment (1957) is pertinent: ‘No 
non-zero / requires any test of significance in this sense: a sample with 
d effective dimensions cannot arise from a population with fewer than 
d dimensions. However small a value of 4 may be, therefore, it cannot 
arise from a parent value of zero. It may be negligible but it is always 
significant.’ I have published examples (1953) where the size of the 
smallest component of a dispersion is subject to experimental control 
and cannot be neglected. And the specific variance Hildebrand excludes 
from consideration is not just a small proportion; according to his 
figures it is 68% of the total. 

If this were all, the search would have been restricted to the f-space 
defined by the unrotated vectors of the first four centroid factors, and 
the part of the D-space confined within this f-space could have been 
explored in its entirety. But a further loss is incurred when rotation 1S 
carried out and two of the rotated vectors are discarded as measuring 
traits lacking interest from the point of view of the psychological hypo- 
thesis. Search is consequently restricted to a reduced f-space of two 
dimensions, and only the cross-section of the D-space confined within 
it is discernible. 

The results would be interesting if loss of this second kind could be 
excluded by some method of analysis, and loss of the first kind, which 
is incurred inevitably when search is restricted to any f-dimensional sub- 
space in an m-dimensional continuum, could be isolated and measurec: 
But I cannot suggest any comprehensive method of analysis which WI 
include all the different f-spaces discoverable by different factor-analytic 
procedures for the same number of factors. We cannot for instance 
argue that all such f-spaces must be contained within the space define 
by the first f principal components, although it includes a larger propor- 
tion of the total dispersion than any other f-space. For if the major 
axis of the dispersion passes through a point P at some distance from 
the origin, other points, say P' for instance, can be found in the near 
neighbourhood of P such that the vector OP’ projects onto any or every 
other principal axis. 

But it is not difficult to compare the variance between groups (Yv 
say) with the variance within groups, i.e. 4, along a principal axis of the 
standardized dispersion within groups. Thus taking each axis in turn we 
can discover whether the ratio y;: A tends to be greater when 4 is large 
or whether some other relationship exists or no obvious relationship at 
all. This subjects to scrutiny the postulate that the dimensions where 
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differences between groups are most significant are those where mem- 
bers of the same group vary most widely from one another. This postu- 
late underlies the use of factor analysis here, where it is supposed that 
neuroticism and I-E describe major dimensions of variation between 
individuals and also account for the incidence of different forms of 
mental disorder. And analogous postulates underly many other appli- 
cations of factor analysis as already explained in Chapter 6, section 3. 

The latent roots and vectors of R,, the matrix of correlations given in 
Table 8.2, are set out in full in Table 8.6.1 Each latent root, A, defines 
the variance along the vector /, which specifies one of the orthogonal 
axes of R,. Thus their total, £4 = m, defines the total variance of the 
Standardized dispersion within groups in all its dimensions. The ratio 
2/21 gives the proportion contributed by each latent root to this total. 
It is listed as a percentage in the centre column below, and serves to 
show how the latent roots diminish progressively in size. Another total 
is formed by adding the variance between groups along the axis /, i.e. 
the quantity here denoted v, to the variance within groups, viz: 
A + v» = V, Thus the ratio v/V; shows the proportion of the total 
Variance along each axis of R, due to the differences between group 
means in that dimension. I have preferred it to the ratio v,/À for 
descriptive purposes and list it, also as a percentage, in the right-hand 
column below. 


Ratio 

Component 1003/ Zà 100 yv,/Vi 
First 29-89 17:43** 
Second 16:334 1021** 
Third 11-62 4:06 
Fourth 9:47 10-64** 
Fifth 8-64 5-68 
Sixth 6:45 14-:36** 
Seventh 611 3-16 
Eighth 5-20 11-61** 
Ninth 3:32 18-97** 
Tenth 2:46 8-30* 

* Significant at the 5% level 
-" "a Se d 1 96 5 


1 

we pee point of view Re seems preferable to Rs for use here (cf. section 1, 
to UT-1." or definitions). Mathematically because itis directly related to T and tience 
normal y Descriptively because it gives the best estimate of the dispersion in the 

omal population. A sample composed of 15% hysterics, 6% anxiety states... 

fas monnaie cannot be regarded as representative of any population whatever. But 
represen no obvious objections to assuming initially that the 22 E-controls are a 
wit ioe sample of normal adult men; and since this group Is homogeneous 
Sheree in its internal variation, as shown by the s.d.'s given for each group 
able Hill On each test in Hildebrand's thesis, summation within groups is reason- 
mate Es R Rd was influenced by similar considerations; he attempted to «cd 

EP d excluding the supposedly extreme groups from his factor analysis. 
.P. InI.—U 
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TABLE 8.6 
Latent roots and vectors of the matrix of correlations in Table 8.2 


Principal components: 


First Second Third Fourth Fifth 

Roots 2:9894 1-6340 11617 0:9468 0-8639 
Variables Vectors (normalized) 

1. —00377  -—03264 0-6581  —0-0949 0:4255 

2. 0-4415 00015 — —02716 00318 — —0:1283 

3. | —02414  —00944 | —03227 0-5158 0:6393 

4. 0:3474 0:0227 —0:2659 —0:3831 0:3312 

5, 04198  —0:2150 0:0303 03729  —0:0107 

6. 0:3933 —0-0595 03215 —0-1644 0:2662 

7 0-4839 0-0852 —0-1131 —0:0137 0:1379 

8. | —0-1913 0-4682 —01113  —04791 0-3308 

9. 0-1129 0-5699 0-0915 0:3716 0:2319 

10. 0-1016 0:5319 0:4249 0078  —0:1755 

Sixth Seventh Eighth Ninth Tenth 

Roots 0-6450 0:6109 0:5204 0:3816 0:2463 
Variables Vectors (normalized) 

D 03108 — —0:3025 0.2060 = —0:1362 0:1270 

2 03941 — —0-0971 03818 — —05106  —0:3778 

3 —0-0346 0-2685 0-2692 0:0136 —0:0780 

4 —04784  —04061 03115 0-2276 0:0674 

5 02465 —0:1926 —0-2376 0:6334  —0:2848 

6. —0-3402 0:5083 —0:2698 —01812 —0:3923 

7 0:2902 0.3376  —0:0736 0-0131 0:7209 

8 0:4858 0.0836 — —01224 02488  —0:2696 

9 —0.1093  —04278 —03981  —03187 0:0491 

10. | —0:0905 0-1893 0-5796 02666  —00274 


Significant differences between the gzoup means seem on this evidence 
equally likely to occur in dimensions where individual variation about 
the means is large or small. The greatest occurs in the ninth component 
and the next greatest in the first. Others are scattered in between instea 
of clustering among the larger components as, according to the postu- 
late, they should. 

The finding is all the more remarkable because the postulate impugned 
seems otherwise so acceptable in the present context. The groups are 
segregated, in the opinion of the psychiatrists, by natural causes oper at- 
ing on constitutionally different individuals. The tests are of two kinds, 
chosen to replicate one another in measuring the two sorts of constitu- 
tional difference supposed to be most important diagnostically. These 
considerations would seem adequate to encourage the belief that the 
best sub-space to search for evidence of differences between the groups 
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is the one where individual variation is widest, i.e. the sub-space defined 
by the first two, or perhaps three components. But, as it now appears, 
a search started at the opposite end, where individual variation is 
smallest, might have proved equally rewarding. 

Knowing, as we do now, that the D-space is three dimensional, we 
may go on to enquire how much of the evidence of the variation in its 
three dimensions, 6,, ó,-and ôs, is provided by each of the principal 
components of R,. The correlations between the two sets of components 
show the relationship conveniently: 


Correlations of the principal components of the D-space 


with those 
of R; 1 2 3 
ds 0:544 0193  —0275 
as — 0-007 0:374 0-562 
As —0:153 0280 —0-159 
Ay 0016 —0680 —0-050 
2s 0:075 0-366 0-081 
Ag 0514 —0-049 —0-134 
A, 0.007 —0-291 0-029 
dg 0207  —0234 0-687 
Ag —0:597  —0047 —0-064 
Ayo —0:031 0058 | —0278 


e eae of variation in à, is to be found in A,, 4; and 2 to an almost 
oe Piet: fe mainly in 24, though À, and A; contribute to a lesser 
iste ies mainly in 2s, but 2, is also relevant. Thus to obtain a good 
n. 289. of the entire D-space all the principal axes of Ry marked ** 
Hu would need to be explored; and perhaps others. 
are eg if it is not safe to assume that the major dimensions of R, 
som eis Where search for discriminance is most likely to be rewarded, 
P E short of a complete analysis may still be sufficient to discover 
dienne about the canonical variates, and in particular, how many 
“ae significantly between the groups. Assuming only that some 
amount. may be found in any of the principal axes of R, a sufficient 
be reas Dn be accumulated from a limited number. And it will still 
rst to jen to consider the largest components first, as they are the 
With the i extracted. How many of the principal axes of R, beginning 
Prove th argest, have to be investigated to obtain sufficient evidence to 
pe at this D-space is 3-dimensional? 

insu orien restricted to the 3-space of A,, 2, and As turned out to be 
cient. But a peculiarly instructive picture is obtained. The three 


canoni ; A f : 
Donical variates have the latent roots, for which I will retain the same 
Notation, j 


9 = 0:2163, hence y? = 34-61 with d.f. 8** 
ôs = 0:1133, „ 5 —1812 , » 6* 
ôs = 0:0375 „ „= 600, » 4 
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The normalized vectors of the first two are 


8, 35 
a 0:9840 0:0706 
A. —0-0685 09975 
ps —0-1645 0:0074 


Thus the latent vector of 6, coincides almost exactly with that of Ay 
and ô, with A,. 

That is to say, the D-space appears only in a 2-dimensional cross- 
section; the first canonical variate coincides almost exactly with the 
major axis of the dispersion of individuals about their group means; 
and the second variate almost exactly with the next principal axis of 
that dispersion. The expectations deduced from the hypothesis this €x- 
periment was designed to test are supported admirably in every respect 


FIG. 8.5. The Scatter of the Groups in the Plane defined by the first two Principal 


Components of the Dispersion within Groups. 
by the evidence shown in this picture, ef. Fig. 8.5, though they are not 
sustained by the evidence as a whole. The projections of the group 
means on 4, and 2, are given below: 


Group ay as 
Anxieties 5-9102 0-9100 
Obsessions 5-8247 0:3839 
Depressions 4:7539 0:6976 
Hysterias 4:9635 1:8174 
Psychopathics 6:0831 1:8432 
Mixed neuroses 5:1883 1:0929 
E-controls 3-4712 1:2602 
All cases 5:1557 1-2198 


The third dimension of the dispersion between groups first appear 
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above our horizon when we start exploring the 4-space of A, to 24. The 
latent roots of the canonical variates are augmented to 

ô, = 0:2189, hence 7? = 35:028 with 9 d.f. ** 

dp = 0.1886, a 5, = 307169 , 7, ** 

65 = 00750; 45 a = 11-993 , 5, * 

ô, = 000388, ,, , = 0603 , 3, 


The normalized vectors for the first three are 


ð, ðs [^ 
Ay 0:9449 0-1892 —0-0102 
Ay 0:0736 —0-6240 0:7189 
ds —0-0916 —0-2820 —0:5556 
s —0-2254 0-7038 0:4176 


The vector for à, does not yet appear to diverge far from that of Àj, 
but those for à, and ôs no longer tend to converge with any of the 
Principal axes of R,. 

One conclusion can be drawn confidently from this investigation of 
the Projection of a D-space on an f-space, where f < m; it is this: if the 
Projection is p-dimensional the D-space must be p dimensional at least. 
Canonical analysis cannot be used restrictively as Spearman and the 
early factorists used factor analysis: p is a minimum, not a maximum. 


6. STORMS’ TREATMENT OF THE DATA 


Very little theoretical justification has been found here for expecting the 
vectors defined by a factor analysis to coincide with those defined by a 
discriminatory analysis of the same data; and the evidence examined 
fas shown that they do not converge closely even under particularly 
favourable conditions, The reason is not because one method of analysis 
: better Or worse than the other, but because the two methods of 
analysis are designed to serve different purposes. MM 
b And for the same reason it is arguable that the guiding principles to 
be followed in assembling a set of tests for a factor analysis are not the 
ther Ones for an experiment in discrimination. For a factor analysis 
ob SN With prior expectations that one psychological trait, Fl, is 
Pei e In a certain range of test situations, another trait € 
and €r range, etc., enough tests are required to cover the range rr 2 
ass enough more to cover the range for F2; and in the set finally 
fe embled there will be a large number of tests clustering together in a 
W distinct categories. 
in = a discriminatory analysis, on the other hand, the tests chosen for 
S sion in the battery should serve either as predictors or suppressors. 
Predictor is any measurement which is found to correlate positively or 
&tively with the criterion. And the predictors assembled in a battery 
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should be as far as possible uncorrelated with one another. For if they 
are their evidence is cumulative, but if on the contrary they are corre- 
lated what can be inferred from one may add nothing to what is already 
known from another. A suppressor is one which correlates positively or 
negatively with a predictor but not with the criterion. Its use is indirect 
and perhaps best explained with the aid of an example: To measure 
verbal ability independently of intelligence is difficult with one test 
alone, for tests of verbal ability usually reflect intelligence to some ex- 
tent. But tests of intelligence which do not involve verbal ability are 
available; and a weighted summation of two tests, one of verbal and 


TABLE 8.7 
Data used by Storms 
PART 1. Means of groups (see footnote) on tests: Parr 2. S.d.'s within 
Suggestibility Leg groups of tests 
Group: Reversals Persistence enumerated below: 
Anxiety states 3:3656 1:6234 1. 120-8664 
Obsessions 3-7611 1-6443 2. 77161 
Depressions 3:0700 1-5706 3. 2:4372 
Hysterics 4:6244 1:3046 4. 0:1113 
Psychopathies 3:0368 1-6818 5. 4717987 
Mixed neuroses 3:3726 1-6029 6. 36:7880 
PART 3. Correlations within groups, Re. Tests 4, 5 and 6 have been reversed, i.e. scored 
negatively: 
2 3 4 5 6 
1. R Scale 0-02626 0-03460 0-00143  0-26072 0-09598 
2. Lie scale —0-17471 0-15891  0-29381 0-22378 
3. Suggestibility 
reversals 017490 0.01383  0:12029 
4. Leg persistence 0.10033 — —0:06931 
5. MMQ 0:46466 
6. Annoyances 
Part 4. L-!, when LL’ = R;: a 
1 2 3 4 5 6 


1. 1-00000 

2. —0-02627 1-00034 

3. —0:03986  0-17863 1-01643 

4. 0:01135 —0-20279 —0-21696 1:03569 

5. 0:27361 —0:31362 —0-05165 —0:04992 1:08817 

6. 0.03387 —0-17665 —0-19981 — 0-19591 —0-51718  1:16937 


Note: Group means on other tests are to be found in Table 8.3. 


one of non-verbal intelligence may provide the required measurement. 


The non-verbal test is used as a suppressor, and scores on it receive a 
negative weight. In general we may say that suppressors control com- 
onents of the total variance of the predictors which would otherwise 
contribute to the error variance of the estimate of the criterion. An 
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again, there are no advantages in including a large number of sup- 
pressors which replicate one another. In the set finally assembled for the 
discriminatory analysis there will be a fairly small number of tests which 
do not tend to form clusters, except where suppressors are linked with 
predictors. 

Storms was guided by considerations of this kind in selecting a bat- 
tery from Hildebrand’s tests for his exercise in discriminatory analysis. 
He limited the measures of neuroticism and I-E to four: the Maudsley 
Medical questionnaire and lie scale, Annoyances and the R scale from 
the Guilford-Martin inventory. To these he added two tests which re- 
vealed Significant differences between neurotic groups but had negli- 
gible factor loadings on neuroticism and I-E in Hildebrand’s analysis 
(cf. Table 8.1). They were the Suggestibility Reversals and Leg Per- 
Sistence tests. A square-root transformation was used for the former 
anda logarithmic transformation for the latter, as the crude scores on 
both tests have very skew distributions. 

The data relate to 145 cases only, as these two tests were not given to 
the E-controls and the H-controls were regarded as suspect. The essen- 
tial particulars are given in Table 8.7, and the triangular matrix L~? is 
Included for the convenience of anyone interested in reconstructing the 
complete analysis. 

Although only six tests are included, the battery is distinctly more 
Sensitive to diagnostic differences than the ten-test battery discussed in 
the Preceding sections (2 to 5), which was originally assembled with 
factor analysis in mind. Some details of the discriminant functions are 
Blven by Storms. The summary below can be compared with the sum- 
mary for the ten-test battery in section 3. 


Obs. Dep. Hys. Psy. Mix. 
Anxiety states -— € 44 ++ = 
Obsessions hé 4 - = 
Depressions A EN deae = 
Hysterias ++ ++ 
Psychopathies + 


ixed neuroses 


Note: +, discrimination significant at the 5% level 
+t, » » ww I% i 


iene, means for the suggestibility reversals and leg persistence tests 
ant In part I of Table 8.7 reveal conspicuous differences between 
iso] hysterics and the other patients, and the discriminant eripi 
inu them effectively. The isolation of the psychopathic group Ji ps 
Psych, Sed; in consequence the difference between the hysterics c = 
ne Opaths is the widest found in any of the comparisons. Bu 
s ae States, mixed neuroses, obsessions and depressions remain co 
Cd together, so there is still scope for further refinements. 
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The first three latent roots from 
ô, = 0:2674, hence 7? 


the canonical analysis are 
= 37.169 with 10 d.f. ** 


ee 


ôs = 0:2357, 32-760 8 
63 = 0:0595, 8-269 6 


The third is insignificant. There is in fact no variate to correspond with 
the first obtained from the ten-test battery, waich contrasted the nor- 
mals with the neurotics but showed no differences between the neurotic 
groups. The disappearance of this vector is not surprising, seeing that 
no control group has been included. The first and second vectors from 
the six-test battery are to be compared with the second and third from 


» ga ce » » 


» p s » » 


the ten-test battery. 


The vectors of the two significant canonical variates are 


Test 
. R scale 
. Lie scale 
. Suggestibility reversals 
. Leg persistence 
MMQ 
Annoyances 


OV RUMNC 


Note: The signs given are appropriate for use with reversed scores on tests 4, 5 and 


ô, 9: 


0:3349 — 0:6756 
0:3953 0:4033 
0:6355 0:0779 
0:5429 0:1274 
—0:0539 —0-1800 
0:1733 0:5711 


6. 


The distribution of the group means in the two dimensions is shown in 


Fig. 8.6. 


There is a striking resemblance between the 6, vector above and the 


vector of the second canonical variate from the ten-test batter 


y. The 


distributions of the group means along the two axes are almost iden- 


tical. This can be seen by a comparison of Figs. 8.4 an 
dimensions the psychopathies lie at the opposite end 


d 8.6: in both 
to the depressions 


and the anxiety states, while the remaining groups cluster around the 


general mean. 


This leaves ô, above presumably comparable to some exte 
in the ten-test battery: since the 6-space and the 10-space 
dimensions in common, where the points locating individua 
tical, the alternative hypothesis that all resemblances are pure y, 
cidental seems rather far-fetched. The introduction of the suggestib 


reversals and leg persistence tests 


nt with ós 
have four 
Is are iden- 
ly coin- 

ility 
has extended the variance 10 this 


dimension (it has the smallest of the three significant latent roots in the 
ten-test battery and the larger of the two in the six-test) by widening the 


distance between the hysterics and the remaining groups. As @ I€ 
is less easy to find any vector with properties such as are ascr 
I-E in the plane exhibited in Fig. 8.6 than it is in 
ô, in Fig. 8.6 simply provides a dichotomy separating the 


the other groups. 


sult 1 
ibed to 
Fig. 8.4. The axis 0 
hysterics fro: 
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This is not a summary of Storms’ paper. Much of the evidence he 
quotes is taken from the analysis of the ten-test battery reported in 
previous sections. And the evidence I have given here relating to his 
Six-test battery does not appear in his paper. Naturally I hope that 
my presentation makes clearer where and how the differences between 
s results of the factor analysis and the canonical variate analysis have 

risen. 

Storms uses this evidence for what might almost be described as an 
attack on Eysenck, which is not quite justified in my opinion. Eysenck 
can hardly be considered personally to blame because factor analysis and 


olys. 


HG. 8.6. The 2-dimensional Dispersion between Groups found in the 6-Test 
Battery. 


fucriminatory analysis serve different purposes, benefit most from dif- 
the les. Rem experiments, and lead to different results. He is none 
ogical t Tee to postulate that introversion-extraversion 1s à psycho- 
Method Es relevant to problems of psychiatric diagnosis, to look for a 
ed obj of measuring it by the weighted summation of scores on spect- 
am RAMS tests, and to employ factor analysis to find suitable weights. 
ver ure that many eminent psychologists would concur that this is 
sj ME line of research. 
at risk orms is fully supported by the evidence when he emphasizes 
orm a $ may be incurred if hysterics are pooled with psychopaths to 
&roup xu of neurotic extraverts for experimental purposes. Such a 
Other re ght be homogeneous with regard to I-E but heterogeneous In 
Under Spects; and without knowing a good deal about the experiment 
Consideration it would be difficult to say whether the risks are 
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likely to be serious or negligible in the particular context. Similar risks 
may also arise if cases of anxiety states and reactive depressions are 
pooled to form a group of dysthymics or neurotic introverts. Although 
Hildebrand’s evidence is not sufficient to demonstrate any significant 
difference between these diagnostic categories the distance between their 
group means and their locations on the axes in Fig. 8.4 are enough to 
suggest that further research may lead to effective discrimination with 
objective tests. Other evidence of differences in test results (cf. Foulds, 
1951 and 1953) can be cited in corroboration. 


7. SUMMARY 


Canonical analysis is applicable to the measurements obtained by giving 
a number of psychological tests to individuals belonging to several 
groups. The means of the g groups form a scatter of g points in the 
m-space defined by the m tests, which is the dispersion between groups. 
The individuals' scores form scatters about their group means which can 
be collected to form a dispersion within groups. When the two disper- 
sions are contracted simultaneously to the extent necessary to normalize 
the variance within groups in all dimensions, the between-group dis- 
persion assumesan ellipsoidal form in a D-space which may be m-dimen- 
sional or less. Its major axis is the dimension where the most significant 
differences between the groups are to be found, the next the dimension 
where differences independent of the first are most significant, etc. he 
canonical analysis of discriminance is a principal component analysis 
applied to this contracted dispersion between groups to define its axes 
and test the significance of the variation there. Working methods are 
described and two examples are discussed. , 
The axes or dimensions so defined do not necessarily coinci d 
those found by factor analysis of the same data. There are no a pr cn 
reasons for expecting the most significant differences to appear in the 
dimensions where the variance of the individuals about their group 
means (or about the general mean for all groups) is widest. In an exper 
ment where the conditions were particularly favourable for the appear 
ance of a relationship, no evidence of any relationship was foun be 
tween the amount of variation within groups in a given dimension an 
the significance of the difference between groups there. z 
If factor scores are calculated for f factors an f-dimensional sub-spac 
is defined within the m-space, and a canonical analysis can be made is 
the projection of the dispersion between groups to be found there. T m 
will give some incomplete information about the principal axes oft E 
original dispersion in the D-space. As fincreases towards m the inform 
tion becomes more complete. The evidence examined here does ven 
suggest that there are any advantages in considering the f-space wi a 
the dispersion within groups is widest except that it is the most C? 


de with 
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venient one to define first. But there may be special reasons for suppos- 
ing that there are particular advantages in some psychological experi- 
ments, which have not been considered here. 

The principles to be followed in selecting a battery of tests for a 
canonical analysis are not the same as for a factor analysis. 


8. BIBLIOGRAPHY, REFERENCES AND ACKNOWLEDGEMENTS 


Canonical variate analysis, the term which has come into use among 
Psychologists for the kind of analysis described here, has too wide a 
generality to be quite appropriate, Canonical analysis of discriminance 
1S my own suggestion: on the one hand the problem of discrimination 
15 what determines the form the canonical analysis takes, and on the 
other canonical analysis is only one of the ways of treating the problem 
of discrimination. Together, the two terms ‘canonical’ and 'discrimi- 
nance’ seem to specify this kind of analysis precisely. 

Rao, whose presentation of the subject is authoritative (1952), dis- 
cusses it under the headings of classificatory problems and the general- 
zed distance between groups. Kendall deals with it in his chapter on 
discriminatory analysis (1957) as the case of k populations. Both 
authors make use of the example published by Rao and Slater (1949) 
Which Seems to have become the standard one. Kendall's account of 
the historical development of methods for analysing multidimensional 
t Tiations between groups, and his systematic treatment of problems of 
esting significance will, I believe, be found most helpful. He shows the 
ose a Importance of the original contributions of Mahalanobis 
mathe 1948) and Wilks (1932). Matrix algebra is usually introduced into 
Row ei courses at the advanced level, but as it is a development 
it & ethods of solving simultaneous equations the elementary operations 
na can be explained to anyone who has encountered such 
ittle € in algebra. Aitken explains them in this way in his excellent 
ma ia (1939). For wider information in this field Lanczos (1957) 
"sd € recommended. Mr. J. G. Hayes suggested using the triangular 
details L to simplify computations (cf. section 3, Chapter 7). Further 

S of this working method are given by Fox (1950). 
Y thanks are due to Professor H. J. Eysenck for the facilities and 
uragement he has given me, and to Miss N. Hemsley who was of 
Breat assistance with the computations. 
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THE PLACE OF THEORY IN 
PSYCHOLOGY 


H. J. Eysenck 


PSYCHOLOGISTS hold quite widely differing views about the useful- 
ness and the general role of theory in psychology. Some stress simple 
inductive methods exclusively, maintaining that the time is not yet— 
Or perhaps may never be!—when more ambitious generalizations of a 
theoretical kind would be appropriate. Others emphasize the similarities 
between all sciences, and conclude that ‘what is good for physics must 
be good for psychology! In all the discussion that has taken place, 
sight is often lost of a very simple fact which, to my mind, is quite 
crucial. Scientific theories differ among themselves to such an extent 
that any discussion about the place of ‘theory’ in science should really 
be restyled: a discussion about the place of theories in science. In par- 
ticular, there is a continuum ranging all the way from weak to strong 
theories, and failure to pay attention to the distinction between these 
two kinds of theories renders much discussion, and many criticisms, 
quite meaningless. 

E theory which is always quoted as the perfect example of the ap- 
fiato of the hypothetico-deductive method is of course Newton's 
e Ory of gravitation. This is a good example of a strong teon, x 
m ence, and it may be worth while to look at those features of it whic : 
Pica it so. In the first place, it is based on a very large eoe » 
In urate observations, made over many years by large groups o! per 2 

the second place, it brings together a number of sub-fields in whic 
peantitative laws—such as those of Kepler and Galileo—had already 

Sen discovered and verified. In the third place, the phenomena in 
Question were relatively clear-cut and unambiguous; in particular, they 


H. J. Eysenck 


were not embedded in or entangled with groups of other phenomena. 
In the fourth place, the mathematical relations in question were not of 
a very complex order—compared, say, with modern atomic theory. In 
the fifth place, and largely as a consequence of the preceding points, 
predictions were uncommonly straightforward and precise; verification 
and infirmation of deductions did not give rise to special problems. 

While for all these reasons Newton’s theory is a good example for 
the beginner, it is seriously misleading for the practical scientist. Very 
few scientific theories are in fact strong theories of this kind; most of 
them lie toward the opposite end of a continuum going from strong to 
weak. The typical weak theory in science shows all the opposite charac- 
teristics to those mentioned in connection with Newton’s. Only few 
observations, and these of doubtful accuracy, are available. Few quan- 
titative or even qualitative laws, universally established, are available 
in sub-fields. The nature of the phenomena in question is by no means 
clear-cut or well understood. Mathematical relations are often very com- 
plex, and predictions are neither straightforward nor precise. What, it 
may be asked, is the use of such a weak theory? 

The answer may be given in the words of the famous physicist J. J. 
Thomson: ‘A theory in science is a policy rather than a creed.’ I quoted 
these words on the front page of my book Dynamics of Anxiety and 
Hysteria to indicate the heuristic nature of the theory there developed, 
and I believe that they contain the key to a proper understanding of the 
function of theory in psychology—where nearly all theories are weal 
theories almost by definition. The value of a weak theory, to put it 
briefly, lies in the fact that it directs attention to those problems which 
most repay study from a systematic point of view; in Thomson's words, it 
defines a policy of action and research. It is by giving rise to worth- 
while research, rather than by necessarily being right, that a weak 
theory makes its greatest contribution to science. 

An example or two may make clearer what I have in mind. On the 
basis of the theory developed in Dynamics of Anxiety and Hysteria, 
predicted that extraverted people would have greater reminiscence 
effects after massed practice than would introverted people. Using the 
pursuit rotor, several investigators have tested this deduction. All those 
using a practice period of five minutes have verified the deduction; none 
of those using a practice period of ninety seconds have done so. This 
illustrates two points. In the first place, our knowledge of the sub-fie 
of reminiscence was too restricted to allow of a more precise specifica- 
tion of differences between extraverts and introverts; clearly length o 
practice should have been specified as an important parameter, but 
could not be so specified because of lack of knowledge in this respect. 
In the second place, the partial verification of the deduction is of c0?" 
siderable interest; it suggests relationships between personality an 
learning theory which are worthy of closer study. At the same time, the 
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direction which such study should take are emphasized by the findings; 
we clearly must concentrate on time relations in the practice period, and 
presumably also in the rest period. In other words, the theory leads us 
to a more precise study of the growth and decay of reactive inhibition, 
which is supposed to underlie reminiscence; the fact that positive rela- 
tions with extraversion have been firmly established for certain time 
intervals suggests that a-policy of research concentrating on this aspect 
will not be a waste of time. 

Tran another example, let us take two deductions from the general 
theory: 


(1) Extraverts will show greater visual satiation; — | : 
(2) Extraverts will have greater difficulties in maintaining visual 


fixation. 


At first sight, these two deductions may appear contradictory. Satia- 
tion can only be measured by the determination of figural after-effects 
following visual fixation of an inspection figure; failure to maintain 
fixation leads to lower figural after-effects. According to (1), therefore, 
extraverts should show greater figural after-effects, while according to 
(2) they should show lesser effects. Any outcome, including equality, 
Could therefore apparently be ‘explained’ in terms of these two deduc- 
tions. This, however, is not quite true. Fixation should not begin to 
deteriorate until some considerable time had elapsed, say 30 seconds; 
satiation should be almost immediate. A combined prediction would 
therefore be that for short inspection times extraverts would have greater 
figural after-effects, while for long inspection times their after-effect 
Scores would be lower. According to this hypothesis, there should also 
be one point at which extraverts and introverts should be equal, namely 
at some intermediate length of inspection time. Again no precise pre- 
diction can be made as to the times involved because our knowledge of 
figural after-effects is not at the moment extensive or quantitative 
€nough for such predictions. Butagain a weak theory has directed atten- 
tion to an important area of research, and the positive results reported 
to date with different types of figural after-effect tests suggests that a 
Policy of research concentrating on this complex of relationships would 
Dot necessarily fail to repay the investigator for his effort. — ; 
hese considerations suggest that it is relatively absurd, particularly 
at an early stage of development of a scientific theory, to construct a 
Score board, with each success and each failure of prediction d 
to give a kind of batting average. There are usually many reasons for E 
apparent) failure of a prediction, but far fewer for its puni = E 
quently failures are of much less interest than successes In eva p scd 
theory at an early stage. In particular, failures may arise, not : e 
the theory is in error, but because the deduction made in à pieds is 
Sub-field makes use of a theoretical model in that sub-field whic 
E.P. 1I—x 305 


H. J. Eysenck 


incorrect; this does not in any way invalidate the general theory. As 
an example, we may take the prediction that extraverts would show a 
more pronounced bowing in the serial position curve effect in nonsense 
syllable learning. According to the Hull-Lepley theory, this effect is due 
to inhibition of delay, and the hypothesis that extraverts are more prone 
than introverts to generate inhibitory potential mediates a clear-cut 
prediction. This prediction could not be verified, however, and neither 
could another one linking depressant drugs with an increased bowing 
effect. A special experiment was therefore carried out to test the Hull- 
Lepley theory, by comparing bowing effects with and without intervals 
between successive presentations of the nonsense syllable series; the 
differences which should have appeared according to the Hull-Lepley 
theory failed to materialize, and consequently it was concluded that the 
Hull-Lepley theory was itself in error (Eysenck, 1959c). Failure to verify 
the deduction from the general personality theory was due, therefore, 
not to an error in the theory itself, but to an error in that part of learn- 
ing theory used to mediate a particular prediction. . 

Even when there is no such error, predictions may not be verifiable 
for a variety of reasons although the theory is in fact correct. Two 
well-known examples are the failure to observe parallax in stellar post" 
tions, which was one of the most direct predictions made from Coper- 
nicus's heliocentric theory of the planetary system, and the failure to 
discover the capillaries which according to Harvey's theory of the circu- 
lation of the blood should intervene between arteries and veins. The 
(correct) explanations given by Copernicus and Harvey, namely that 
the stars were too far away, and the capillaries too small, to make ob- 
servation of the predicted effect possible with available instruments, 
were not at the time susceptible to proof. Scientists tended to accept 
the theories in question because they unified a large number of facts, 
although these apparently crucial deductions remained unverified for 
a long time. 

Altogether, the notion of an experimentum crucis to decide the cor- 
rectness of a theory, or to decide between alternative theories, is OC 
which appears more frequently in the pages of popular expositions o 
scientific method than in actual practice. It will be remembered that tWO 
members of the Thomson family, father and son, were both awarde 
the Nobel prize in physics, one for showing conclusively that light wei 
of the nature of a particle, the other that it was of the nature of a wave: 
Having performed crucial experiments to prove both of these ar 
tive theories regarding the nature of light, physicists are still left wit 
something which sometimes behaves like a wave, sometimes like a par” 
ticle; they have also learned rather painfully that crucial experiments 
are seldom as crucial as they are supposed to be—even in connection 
with a strong theory! . ; — 

We may put the whole am i. did differently, by following 
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discussion given by Cohen and Nagel (1936). They take as their example 
Foucault’s famous experiment in which he showed that light travels 
faster in air than in water. This was considered a crucial experiment to 
decide between two hypotheses: H,, the hypothesis that light consists 
of very small particles travelling with enormous speeds, and Hg, the 
hypothesis that light is a form of wave motion. H, implies the proposi- 
tion P, that the velocity.of light in water is greater than in air, while Hy 
implies the proposition P, that the velocity of light in water is less than 
in air. *According to the doctrine of crucial experiments, the corpuscular 
hypothesis of light should have been banished to limbo once and for 
all? However, as is well known, contemporary physics has revived the 
corpuscular theory in order to explain certain optical effects whiah 
cannot be explained by the wave theory. What went wrong? " 

As Cohen and Nagel point out, ‘in order to deduce the proposition 
P, from Hy, and in order that we may be able to perform the experiment 
of Foucault, many other assumptions, K, must be made about the 
nature of light and the instruments we employ in measuring its velocity. 

Consequently, it is not the hypothesis H, alone which is being put to 
the test by the experiment—it is H, and K.' The logic of the crucial 
experiment may therefore be put in this fashion. If H, and K, then Pj; 
if now experiment shows P, to be false, then either H; is false or K (in 
Part or complete) is false. 'If we have good grounds for believing that 
K is not false, H; is refuted by the experiment. Nevertheless the experi- 
ment really tests both H, and K. If in the interest of the coherence of 
Our knowledge it is found necessary to revise the assumptions con- 
tained in K, the crucial experiment must be reinterpreted, and it need 
not then decide against Hj.’ - 

, We may now indicate the relevance of this discussion to our distinc- 
tion between weak and strong theories. Strong theories are elaborated 
On the basis of a large, well founded and experimentally based set of 
assumptions, K, so that the results of new experiments are interpreted 
almost exclusively in terms of the light they throw on Hi, H: Ay 

eak theories lack such a basis, and results of new experiments may 

e interpreted with almost equal case as disproving H as disproving K 

See the example given above of serial position learning effects). The 
relative importance of K can of course vary continuously, giving rise 
toa Continuum; the use of the terms ‘strong’ and ‘weak’ is merely in- 
tended to refer to the extremes of this continuum, not to suggest the 
existence of two quite separate types of theories. In psychology, K is 
infinitely less strong than it is in physics, and consequently theories 1n 
Play inevitably lie towards the weaker pole. — 
of eak theories in science, then, generate research the reo ^ 

Which is to investigate certain problems which, but for the theory » 
question, would not have arisen in that particular form; emis n 
Purpose is not to generate predictions the main use of which is 
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direct verification or infirmation of the theory. This is not to say that 
such theories are not weakened if the majority of predictions made are 
infirmed; obviously there comes a point when investigators turn to 
more promising theories after consistent failure with a given hypothesis, 
however interesting it may be. My intention is merely to draw attention 
to the fact—which will surely be obvious to most scientifically-trained 
people—that both proof and failure of deductions from a scientific 
hypothesis are more complex than may appear at first sight, and that 
the simple-minded application of precepts derived from strong theories 
to a field like psychology (or modern sub-atomic physics) may be 
extremely misleading. Ultimately, as Conant has emphasized, scientific 
theories of any kind are not discarded because of failures of prediction, 
but only because a better theory has been advanced; this may account 
for the longevity of Hull's system in spite of the many onslaughts on it 
in recent years. 

There is a further characteristic of weak theories, as contrasted with 
strong, which deserves mention. In strong theories the different postu- 
lates are interdependent; it is not possible to change one without chang- 
ing the rest, and indeed throwing overboard the whole theory. In weak 
theories, such interdependence is much less marked, and changes 1n 
one part of the theory are quite permissible without the necessity O 
altering other parts as well. Thus Hull’s theory of inhibition is periphera 
and ‘work’-oriented; I have preferred (for various experimental reasons 
to work with a central theory of inhibition rather more akin to Pavlov s- 
This substitution, as well as many others, can be made without extend- 
ing the framework of Hull’s theory unduly; nothing of this kind woul 
have been possible with Newton’s theory of gravitation. Weak theories 
are very flexible; that is why they are such good guides for research; 
strong theories have an air of ‘take it or leave it’ which makes them 
superior as guides to action, but also less likely to lead to important new 
discoveries. . 

There is, of course, one point which all scientific theories have 1n 
common, and which decisively sets them off from non-scientific theories. 
However weak a theory may be, it must generate predictions W lc 
admit of experimental investigation. In other words, the theory must E 
reality-oriented, and the manipulations of reality implicit in its testing 
must be capable of being made explicit without ambiguity. The hypo- 
thesis that planetary motions can be explained in terms of angels pushing 
the planets around on their courses, preordained by God, is not à 
scientific theory because it suggests no direct experimental investigation. 
It is an interesting question whether such concepts as the Oedipus 
complex, or the super ego, or the archetype, have characteristics whic 
make them suitable for use in scientific theories, however weak, OT 
whether they are outside science altogether; it would take us too far to 
discuss this point in detail. 
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Some scientific concepts, of course, have properties which are very 
difficult to disentangle from the matrix of events, even in the laboratory. 
Take for example the notion of the neutrino, first suggested by Pauli in 
1931 in order to explain certain facts of radioactivity; this particle not 
only has no electric charge, but also has only negligible mass when at 
rest. All the properties assigned to it were those needed to balance cer- 
tain physical equations, and but for the development of atomic energy 
lts actual experimental demonstration would have remained impos- 
sible. As W. F. G. Swann said, ‘it has no act’, i.e. there is nothing (or at 
least very little) in its performance which could be directly tested. Never- 
theless, the neutrino has a very slight probability of interacting with any 
proton which it might encounter (one chance in 10** roughly), and this 
has been used to demonstrate its existence. Hull's ‘involuntary rest 
pauses' may give us just as much trouble as did the neutrino to physicists, 
and for the same reasons—they don't seem to have much of an act! 

One of the main consequences of having to deal with weak rather 
than with strong theories is that attention should shift from considera- 
tions of right and wrong to considerations of fruitful or useless. It is 
difficult enough to disprove a strong theory; it is almost impossible to 
disprove a weak one. The point has been well made by Roby (1959): 
Ultimately, theory must answer to the facts, but this is not the only 
Tequirement placed on a theoretical system. Logical consistency, eco- 
nomy of assumptions, and even a degree of elegance are by no means 
Secondary factors in determining the overall staying power of a theory. 
These patrician qualities are quite unlikely to mature, however, if the 
demand for direct descriptive capability is too insistent. It is not... 
contended that theory construction should be totally unresponsible to 
the general body of knowledge about behaviour. Rather it is held that 
Point-by-point testing of isolated facets of a theory against specific 
behavioural phenomena or experimental findings is at odds with the 
Whole purpose of theoretical abstraction. Suggestive hypotheses should 
not be put directly to drudgery but should be entertained for a while, 
as rare and welcome guests. ` 

Weak theories usually imply the absence of precise, trustworthy data, 
and it is interesting that in the history of science most strong theories 
Started out, in fact, as weak theories; their very existence stimulated the 
accumulation of precise data which later transformed these theories into 
Strong ones, The heliocentric theory of Copernicus was based on 
Wretchedly poor and inaccurate data; indeed, available observations 
Were so erroneous and unreliable that they could not be used to arbitrate 

tween the Copernican and the Ptolemaic theories. It was a weak 
cory in every respect, but by its very existence it encouraged astrono- 
ie to seek for ever more accurate observations, until Tycho Brahe 
ally gathered data reliable enough for Kepler to verify his three laws 

Planetary motion ; these in turn mediated Newton’s great synthesis. 
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This would appear to be the answer to those who feel that the publica- 
tion of weak theories is premature, and should await more precise 
measurement and other advances in knowledge: such precise measure- 
ment, and such other advances, are usually the consequence of the 
interest aroused by a challenging new theory, however weak. Without 
the existence of the theory, there would be littie motivation for the more 
precise measurements to be made. Every strorg theory started out as a 
weak one (but of course not every weak theory necessarily becomes à 
strong one!); premature demands on such a theory for accuracy and 
rigour appropriate to strong theories are less likely to lead to advances 
than a realistic insight into the limitations of weak theories. , 
Roby goes on to point out that absolutist notions of validity in 
scientific theory find no counterpart in physics. He distinguishes three 
major types of validity. ‘First, a theory may be valid in a subjunctive 
sense. That is, it may describe the behaviour of entities under condi- 
tions that never obtain in fact. The Galilean law of falling bodies and 
the ideal gas laws are examples. Second, a theory may be locally true, 
that is, may hold over certain ranges of the relevant variables. Hooke's 
law of elasticity is a clear-cut example, and the Newtonian laws © 
motion (holding for ‘‘middle-sized” phenomena) are now accepted in 
this sense. Finally, a law may hold statistically, being supported by 
large numbers of observations although there are local exceptions. The 
standard example is the law of increasing entropy. In each case; the 
postulated “law” can be definied as the potential limit of observational 
approximations as certain stipulations are satisfied.’ Roby goes on to 
point out that ‘it may well be that behavioural laws must invoke al 
three of these forms of licence in relating predictions to the world o 
observation. In other words, we may be reduced to predicting relation- 
ships among parameters of statistical distributions which can be em- 
pirically sampled within certain limits only, and which are subject to 
extraneous disturbing factors." ] 
This discussion has been rather general, and it may be of some inte 
to relate it more directly to the kind of problem raised by the set 0 
theories tested in this book. We may begin by considering the order o 
size of expected correlations, bearing in mind the ‘extraneous disturbing 
factors’. According to our theory, different measures of conditioning 
should intercorrelate positively because speed of formation of excitatory 
and inhibitory potential, which are fundamental to speed of condition- 
ing, are supposed to be permanent properties of individual organisms: 
It might seem at first sight that a correlation of unity should be foun 
between any two measures of conditioning. This, however, is not 50- 
In the first place, we have the fundamental problem of deciding whether 
a given procedure is in fact a measure of conditioning or not. Learning 
theory (Cohen and Nagel’s ‘K’) does not give us a clear answer, an 
the chapter by Willett in this book shows clearly the difficulties which 
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are placed in the way of deciding whether a given procedure is in fact 
a measure of conditioning, or whether the observed changes are simple 
artefacts, or what precisely is taking place. In the second place, the 
classical procedures, even if we could be certain that they may justifi- 
ably be regarded as measures of conditioning, do not show very high 
reliabilities; figures of 0°6 or thereabouts may perhaps be regarded as 
typical. This would lead us to expect correlations between two such 
measures to be in the neighbourhood of 0:6 x 0:6 — 0:36, which is a 
much lower value than unity. 
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A, = PGR Conditioning A, = Introversion 
B, = Eye-Blink Conditioning B, = B, 
1 = Number of Sweat Glands C, = Environmental Factors 


D, = D, = Other Extraneous Factors E, = E; = Sensitivity of Cornea 
F, = F, = Other Extraneous Factors 


But this is not all. Let A and B in the figure stand for the total non- 
chance variance of PGR conditioning and eye-blink conditioning scores 
respectively of a group of subjects. The overlap between these circles 
Goons the existence of a correlation (corrected for attenuation) of 
in 07. The correlation fails to be unity because both sets of scores are 
A „Part determined by extraneous causes which are irrelevant to the 

iginal hypothesis, and which are specific to each set of measurements. 
the examples, I have given the number of sweat glands in the palm (for 

* PGR), and the sensitivity of the cornea (for eye-blink conditioning). 

and H Schematically indicate the presence of other, unidentified fac- 
is Which in part determine score on these two tests of ‘conditionability’. 
then addition to the common, underlying factor of become 
ha We also have forces making for specificity in score on the ve 

F of conditioning test. These forces may be so strong as to reduce 
be communality to a very small figure; sometimes of course they may 

Mite negligible, Without knowledge of all these irrelevant extraneous 
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causes (which should form part of ‘K’) it is difficult to make any pre- 
cise prediction, or to interpret experimental results, particularly when 
these are indicative of only small or virtually non-existent relationships. 
It should of course be possible to measure separately the causes shown 
as C and E, and to treat them as suppressor variables in an equation; 
this would correct the observed correlation fér attenuation due to this 
particular extraneous variable. Alternatively, it should be possible to 
rule out some of these variables experimentally; thus by adjusting the 
force of the UCS (the puff of air) in the eye-blink conditioning experi- 
ment in terms of individual thresholds and j.n.d. values for different 
subjects, we should be able to control this particular disturbing influ- 
ence. (This obvious precaution has not been taken in the published 
work of Spence, Taylor, or Franks, all of whom have been concerned with 
the measurement of individual differences in eye-blink conditioning.) 

It is more difficult to know what to do with D and F, as these letters 
represent hypothetical variables which are unknown and not specified by 
learning theory; indeed, they may be non-existent. Only the closest and 
most careful study of the particular conditioning measures used in the 
experiment will gradually elucidate all these complexities, mercifully 
hidden in the diagram under innocuous letters and cross-hatched 
figures! 

The discussion will make it clear why the fact that empirically ob- 
served correlations between different conditioning measures have usually 
been low cannot be interpreted as being a fatal objection to the theory 
of the existence of a general factor of ‘conditionability’. The fact that 
they are usually positive should be taken as a spur to purify the measures 
concerned, increase their reliability, and elucidate the true relationships 
existing between the underlying variables. Negative results at such an 
early stage of development are no more conclusive than were the Te 
peated failures for several hundred years to observe stellar parallax. 

When we turn to the relationship between personality variables like 
extraversion, and experimental variables like conditionability, similar 
considerations apply. The restrictions imposed by lack of reliability 
are just as important here as in the case of correlating one measure © 
conditioning with another (Eysenck, 1959). As regards the influence © 
extraneous variables, we may also use the same arguments as before. 
Let us turn back to the figure, letting A, represent introversion this 
time, and B, eye-blink conditioning. The only change we have to make 
is to replace C, by C,, which we now take to represent differential en- 
vironmental influences. Introversion, according to the theory, repre" 
sents the effect of superior conditionability under conditions of equa 
external (social) pressure. Clearly such equality implies validity of the 
theory in what Roby has called a subjunctive sense; ‘that is, it may 
describe the behaviour of entities under conditions that never obtain 
in fact’. We cannot experimentally do much to produce equality, but 
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we can possibly use an estimate or a measure of inequality as a suppres- 
Sor variable in our equations. This is being done at the moment in a 
research being carried out under a grant from the Home Office. The 
purpose of this work is to test the theory that low conditionability 
(extraversion) predetermines children to grow up into less strongly 
Socialized adults, the hypéthesis being that the process of socialization 
1s mediated by conditioning. Now clearly the deduction that criminals 
Should be more extraverted than matched non-criminals would only 
follow if the amount of conditioning received in the direction of sociali- 
zation had been equal for all the subjects of the experiment. An introvert, 
brought up and conditioned in Fagin’s kitchen, would presumably be 
more likely to become a criminal than an extravert brought up under 
Severe disciplinary conditions in an honest home. Thus unless separate 
estimates are available of conditions of upbringing, the correlation 
between criminality and conditionability, or between criminality and 
extraversion, could take on almost any value at all, depending on the 
Selection of cases. 

The same consideration applies to the problem of the relation between 
extraversion-introversion, on the one hand, and hysteria-dysthymia, of 
the other. I have followed Jung in hypothethizing that extraverted per- 
Sonality predisposes a person to develop hysterical and psychopathic 
Symptoms, while introverted personality predisposes a person to develop 
dysthymic symptoms. Such a hypothesis does not postulate a one-to-one 
Correspondence, however; it preserves the distinction between personal- 
ity, on the one hand, and symptomatology on the other. Foulds (1958) 
has argued, quite correctly, that the two should not be identified; this is 
Implicit in my statement of the theory. Anxiety is regarded as a condi- 
tioned fear response, and the hypothesis linking anxiety with introver- 
Sion was elaborated on the basis of postulating greater conditionability 
on the part of introverted people, leading to the ready elaboration of 
Conditioned fear responses. But the development of conditioned re- 
Sponses is determined by other factors than the degree of conditionability 
es an organism; frequency of conditioning trials, stimulus-response 
t Ynehronism, delay in reinforcement, and the nature and strength of 
im Teinforcing agent are other variables which may be vastly more 
og rant in any particular situation. By pleading for the recognition 

this one factor, I did not intend to plead for the neglect of all others! 
wi extraverts do form conditioned responses, after all; wits ed 
uh eg Paribus that they condition less quickly than introverts epe 
usual] experimental conditions (and not always then J E od im 
sponse. par la, and extraverts may therefore form conditione a 
Prais "e While introverts fail to do so. Again, therefore, a realistic k 
of the must include not only conditionability, but also an ^ eem 
tic Gan conditions responsible for the development of the a 

Order in question. It is only when this assessment is used as à 
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suppressor variable in the correlation that we may say that introversion 
= dysthymia, and extraversion = hysteria/psychopathy. It would seem 
to follow from our theory that greater uncertainty should attach to the 
diagnostic classification of neurotics of extraverted personality, as these 
may develop anxiety and other dysthymic reactions under appropriate 
conditions; it is more difficult to imagine circumstances under which 
neurotic introverts would develop hysterical or psychopathic symptoms. 
Foulds’ results confirm this prediction. It is to be hoped that his pioneer 
experiment will be followed up by others, in an attempt to test this 
general hypothesis; the relation between personality and symptom de- 
velopment has been curiously neglected by psychiatrists and psycholo- 
gists alike. 

In summary of this discussion, it might perhaps be fair to repeat that 
when we are dealing with a strong theory, successful predictions are 
commonplace and do not do much to enhance the validity of the theory, 
while failures to predict correctly are very serious and may be disastrous 
for the theory in question. When we are dealing with a weak theory, 
however, the reverse is true. Successful predictions, particularly if un- 
likely on commonsense grounds, are important and valuable, and do 
much to support the right of the theory to be considered seriously. 
Failures, however, are not unequivocal enough to be taken too seriously; 
they certainly do not damage the theory in the same way as do failures 
in the case of a strong theory. 

We may put this point of view in terms of information theory. If we 
may take K as certain (or near certain), then the confirmation Or 10- 
firmation of H gives us one bit of information. (Provided both outcomes 
are equiprobable.) If we take K as quite uncertain, then the failure of 
to be verified may have many causes; its success, however, is unlikely 
to have a cause other than the one specified. Consequently, success gives 
us more information than failure. It gives us information, not only 
about H, but also about K. Success implies that both H and K are 
essentially correct; failure implies that either H, or any part of K may 
be in error, and this is too vague to be very useful. Failure may O 
course be used to suggest more complex relationships, improvement? 
in technique, or limitations in choice of parameters, but it is not in itsel 
very informative. Information gained by successful tests of a strong 
theory is largely redundant, while failure is highly informative. The 
reverse is true of weak theories. 


1 An example from physics may make this statement clearer. There are certain 


stars, the so-called ‘white dwarfs’, to which Eddington’s mass-luminosity principle 
does not apply; they are supposed to be much more massive than might be expecte 
from their luminosity. (This is because in them matter reaches an extraordinary hig 
degree of density at which the ordinary gas laws cease to apply.) Use has been ma E 
of this condition to furnish another proof for Relativity theory. This theory predict 
that the apparent frequency of a periodic phenomenon— such as atomic vibration ^ 
changed when thesourceof the vibration issituated in a strong gravitational field. Th 
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What has been said in this chapter should not of course be taken to 
mean that the psychologist has unbridled licence to theorize to his 
heart’s content, regardless of negative results. For a theory to be re- 
garded seriously it must show some evidence, both internally and ex- 
ternally, that it can mediate experimental predictions. But once this 
evidence is available, earty failures of some predictions should not be 
taken too seriously; they should be regarded as a challenge to discover 
the causes of the failure, rather than as necessarily disproving the theory. 
This, of course, implies a tolerance of ambiguity, an ability to suspend 
Judgement, which is often difficult to maintain. Nevertheless, the whole 
history of science argues powerfully against rash decisions on the 
basis of inadequate data; time and time again we have seen buried 
theories rise afresh from the ashes to which they had been prematurely 
consigned. 

Nor should expectations be too high of close relationships between 
experimental variables. Much refinement will have to go into the raw 
scores obtained from psychological tests before we can begin to claim 
to measure one single variable, rather than a mixture of the most diversi- 
fied traits, abilities and attitudes. The best that can be expected is a set 
of low correlations, usually in the expected direction, but occasionally 
directly Opposed to prediction; on such a foundation we can then begin 
to erect the infinitely complex set of laws and functional relationships, 
Concepts and definitions, which will ultimately, shorn of ambivalences 
and ambiguities, constitute that proper science of behaviour and per- 
Sonality which so obviously does not exist at the present time, except 
Possibly as a foundation for a palimpsest. 


Mowing down of the vibration shows itself in a slight displacement of spectral 
cus towards the red; it was observed in 1925 by Adams in the spectrum of the 
o qPonion of Sirius. Vaucouleurs (1957) points out: *As the observed effect was in 

eu Agreement with the theoretical prediction, both the theory and the existence of 

ly dense matter in white dwarfs were confirmed.’ Here the relativity theory 

Sar being tested in relation to a,body of knowledge (K) not itself too firmly 

comet Confirmation of the hypothesis is likely only if both H, and K = 

theory ; and Consequently contributes much to our knowledge of both — y 

due d and of the nature of white dwarfs. Failure of the hypothesis could have been 

Of white sos in the relativity theory, or to lack of true knowledge of the composition 

ite dwarfs; it would therefore been ambiguous at best, and not very informative. 
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